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O ■ ABSTRACT 

L| ' The UKIDSS Galactic Plane Survey (GPS) is one of the five near infrared Public 

. !^ \ Legacy Surveys that are being undertaken by the UKIDSS consortium, using the Wide 

. Field Camera on the United Kingdom Infrared Telescope. It is surveying 1868 deg^ 

Vh ' of the northern and equatorial Galactic plane at Galactic latitudes —5° < 6 < 5° in 

5^ \ the J, H and K filters and a ~ 200 deg^ area of the Taurus-Auriga-Perseus molecular 

cloud complex in these three filters and the 2.12 /xm (1-0) H2 filter. It will provide 
data on ~ 2 X 10^ sources. Here we describe the properties of the dataset and provide 
a user's guide for its exploitation. We also present brief Demonstration Science results 
from DR2 and from the Science Verification programme. These results illustrate how 
GPS data will frequently be combined with data taken in other wavebands to produce 
scientific results. The Demonstration Science comprises six studies. (1) A GVS-Spitzer- 
GLIMPSE cross match for the star formation region G28.983-G.603 to identify YSOs. 
This increases the number of YSOs identified by a factor of ten compared to GLIMPSE 
alone. (2) A wide field study of the M17 nebula, in which an extinction map of the 
field is presented and the effect of source confusion on luminosity functions in different 
sub-regions is noted. (3) H2 emission in the p Ophiuchi dark cloud. All the molecular 
jets are traced back to a single active clump containing only a few protostars, which 
suggests that the duration of strong jet activity and associated rapid accretion in 
low mass protostars is brief. (4) X-ray sources in the Nuclear Bulge. The GPS data 
distinguishes local main sequence counterparts with soft X-ray spectra from Nuclear 
Bulge giant counterparts with hard X-ray spectra. (5) External galaxies in the Zone 
of Avoidance. The galaxies are clearly distinguished from stars in fields at longitudes 
Z>90°. (6) IPHAS-GPS optical-infrared spectrophotometric typing. The (i'-J) vs.(J- 
H) diagram is used to distinguish A-F type dwarfs, G dwarfs, K dwarfs and red clump 
giants in a field with high reddening. 
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1 INTRODUCTION 

The United Kingdom Infrared Deep Sky Survey (UKIDSS) 
is a suite of five public surveys of varying depth and area 
coverage which began in May 2005 and are expected to con- 
tinue until 2012. These surveys all use the Wide Field Cam- 
era (WFCAM, see Casali et al.2007) on the United Kingdom 
Infrared Telescope (UKIRT). The design and the principal 
aims of the five UKIDSS surveys are described by Lawrence 
et al. (2007), along with a brief example of data from each 
survey. The data are reduced and calibrated at the Cam- 
bridge Astronomical Survey Unit (CASU) using a dedicated 
software pipeline (Irwin et al.2008) and are then transferred 
to the WFCAM Science Archive (hereafter the WSA, Ham- 
bly et al.2008) in Edinburgh. There, the data are ingested 
into a sophisticated SQL database and detections in the dif- 
ferent passbands are merged. 

The UKIDSS data are released as SQL databases at 
intervals of six to twelve months. The area coverage and data 
quality of these releases are described by Dye et al.(2006, 
the Early Data Release), Warren et al. (2007a, the First Data 
Release) and Warren et al. (2007b, the Second Data Release). 
In addition to the major releases, smaller batches of pipeline 
processed images and their associated FITS binary source 
catalogues for individual passbands are released on a faster 
timescale, but with no quality control. The raw data are 
available through CASU and the ESQ archive. The UKIDSS 
data are presently public to all professional astronomers in 
any ESQ member state. World access to the various data 
releases begins 18 months after each data release to ESOQ 

The aim of this paper is to provide a guide to the GPS 
database in the WSA. We describe the properties of the 
data in detail, illustrating usage of the WSA tools and giving 
examples of how to combine GPS data with data from other 
wavebands for maximum scientific benefit. This should be 
regarded as essential reading for those wishing to exploit the 
data. This paper describes the main GPS-specific aspects of 
the UKIDSS database in one place. We note that neither 
the pipeline nor the WSA were designed explicitly for the 
GPS but rather for the UKIDSS surveys as a whole. Some 
compromises were made to produce an efficient database in 
a reasonable period of time and we describe their efi'ects 
here. 

The paper is structured as follows. In §2 we summarize 
the survey design, including recent changes from the original 
plan described by Lawrence et al. In §3 we describe the prop- 
erties of the GPS dataset (depth and stellar populations) as 
a function of Galactic location and source confusion. In §4 
we present Demonstration Science results from the Science 
Verification programme and from DR2 that are designed to 
illustrate the variety of uses of GPS data, in combination 
with data from other wavebands. Finally, Appendix A con- 
tains a detailed guide to the WSA for GPS science. The 
guide illustrates how to use the parameters available in the 
GPS database to best effect and describes the strengths and 
weaknesses of the data that are presently available. 

^ Users can obtain a WSA account through their local Com- 
munity Administrator at their home institution. If the insti- 
tution has no UKIDSS community then they should contact 
the Consortium Survey Scientist, currently Steve Warren (email: 
s.j.warren@ic.ac.uk) 



2 SUMMARY OF GPS STRATEGY, AREA 
AND INTEGRATION TIME 

The GPS is designed to cover the entire northern and equa- 
torial Galactic plane that is accessible to UKIRT in a 10 
degree wide band around the sky. There is also a narrower 
southern extension to the Galactic Centre and a 200 deg^ 
survey of the Taurus- Auriga-Perseus (TAP) molecular cloud 
complex, which lies off the plane and follows the contours 
of the ^"^CO gas mapped by Ungerechts & Thaddeus (1987). 
The southern extension is narrower because of its poorer 
accessibility from Mauna Kea. In Galactic coordinates, the 
in-plane region is: 141 </ < 230, -5 < 6 < 5; 15 < Z < 107, 
-5 < fo < 5; and -2 < Z < 15, -2 < fe < 2. The section 
of the plane at Z=107 to 141 is not covered because it lies 
north of Dec=-|-60° , a region of the sky that UKIRT cannot 
reach due to fundamental features of the telescope design. 

Observations are made in the J, H and K bandpasses 
with total on-source integration times of 80s, 80s and 40s 
respectively. These integrations are made up of shorter ex- 
posures, employing a sub-pixel dithering strategy to fully 
sample the image profile (see §A1.2). WFCAM has four spa- 
tially separated arrays, so a tiling strategy is employed to 
obtain images of contiguous areas of the sky (see §A1.1). The 
median 5a depths in DR2 given by Warren et al. (2007b) are 
J=19.77, H=19.00, K=18.05 (Vega system). However, the 
survey depth is spatially variable due to source confusion so 
these figures should be used with caution, see $3. Data in 
the three filters are generally collected within an interval of 
20 minutes for any point on the sky, in order to minimise 
the effects of any photometric variability on source colours 
and derived extinction. Exceptions can occur (in DR2 and 
later releases) when data that failed quality control in only 
one or two filters were later repeated in only those filters. 
In these cases there will always be near contemporaneous 
fluxes in at least two filters which can be used to estimate 
intrinsic source colours and extinction. The TAP molecular 
cloud component of the survey also includes observations 
in the 2.12 yLxn (1-0) S(l) II2 filter, with a total on source 
integration time of 160s. 

The data reduction pipeline is described in Irwin et 
al.(2008), so we do not provide a full description here. Some 
of the more significant details that may affect the data in un- 
expected ways are described in §A2. One important aspect 
of the reduction that is worth reiterating here is the pho- 
tometric calibration. The photometric zero points are boot- 
strapped from the 2MASS Point Source Catalogue, which is 
believed to provide a reliable photometric calibration over 
the whole sky (Skrutskie et al.2006). If we assume that the 
2MASS calibration is perfect, the scatter in the derived 
zero points for the numerous 2MASS stars used in each 
WFCAM field indicates that the absolute accuracy of the 
UKIDSS photometric calibration in the J, H and K band- 
passes is 1-2% in photometric conditions and 2-3% in non- 
photometric conditions (Hodgkin et al., 2008, submitted). 
Since the 2MASS J, H and Ks filters are slightly different 
from the WFCAM fihers (see Hewett et al.2006), photo- 
metric transformations are used to correct the zero points. 
From DR2 onward, these transformations include extinction 
terms to correct for the fact that the colours of stars in the 
Galactic plane are infiuenced by interstellar reddening as 
well as temperature and gravity (Warren et al. 2007b). These 
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transformations are included in §A2 for completeness. They 
should be used in preference to the preliminary transforma- 
tions given in Hewett et al.(2006). 

UKIDSS astrometry is also calibrated using numerous 
bright 2MASS stars in each field. The precision of the as- 
trometric solution is typically 0.09 arcsec for GPS data, as 
determined from the r.m.s. residuals of the stars to the fit 
(see §A1.3 for details of how to determine the precision for 
specific fields) . Individual stars can of course have much less 
precise astrometry, owing to source confusion or low signal 
to noise ratio. 

Owing to source confusion the survey depth is spatially 
variable in the more crowded parts of the plane (see §3.1). 
Hence, in many regions of the plane the GPS represents the 
deepest survey possible with ground based, seeing limited 
wide field instrumentation. Deeper images can be obtained 
by using higher resolution cameras with smaller pixels in 
conditions of good seeing, but for the foreseeable future such 
cameras will not be able to cover large areas of the Galactic 
plane. 

The sky coverage in the various data releases so far 
is described in the references given in §1. Those areas 
of the plane at —2 < / < 107 with three band cov- 
erage can be viewed as an interactive three colour mo- 
saic at |http://surveys.roe^.uk:8080/wsa/gps-mos aic.jsp[ 
(credit M.Read) where the finest spatial scale has l"pixels. 

2.1 Changes to the GPS Programme 

In early 2007 there was a review of the UKIDSS project in re- 
sponse to a call for additional large observing proposals. This 
led to changes and reductions in the UKIDSS programme 
in order to accomodate the new observing campaigns. The 
original GPS design (Lawrence et al.2007) also included two 
additional epochs of data in the K band only, which were 
intended to provide a long time baseline for proper motions 
and to detect high amplitude variable stars. Following the 
2007 review, further progress on these K-only scans of the 
survey area has been deferred until at least 2010. Some 500 
deg^ of the plane have already been observed in the K- 
only mode during conditions of thin cirrus cloud (defined 
as <20% drop in throughput), covering regions not yet ob- 
served in the main three band survey. Since the UKIDSS 
photometry is calibrated using numerous 2MASS sources in 
every field there is only a very small loss of precision for rela- 
tive photometry (at the millimagnitude level) even when ob- 
serving through thin cirrus (Hodgkin et al.2008). The main 
JHK survey has been conducted in photometric conditions, 
in order to ensure maximum depth, but thin cirrus condi- 
tions (<10% absorption) may be accepted from late 2008 in 
order to assist timely completion of the survey. 

It was originally planned to observe a region 300 deg^ 
of TAP molecular clouds three times: once in the J, H, K 
and H2 filters and twice more in the K and II2 filters, in 
order to measure the velocities and accelerations of proto- 
stellar jets. However this part of the survey was cancelled 
in the 2007 review of UKIDSS, at which point ~150 deg^ of 
the region had been observed. Nonetheless, the remainder of 
the TAP region will observed in the Z, Y, J, H and K filters 
as part of the ongoing UKIDSS Galactic Clusters Survey, 
which included the same region. Observations of the TAP 
region with the H2 filter are continuing via Directors Dis- 



cretionary Time and small observing programmes, with the 
aim of completing one pass of the denser cloud cores, with a 
total area of ~200 deg^ The TAP area coverage is described 
in more detail in Davis et al.(2008), and the reduced data 
can be viewed at www. jach.hawaii.edu/UKIRT/TAPQ 



3 POPULATIONS AND DATA PROPERTIES 
AS A FUNCTION OF GALACTIC 
COORDINATES 

In this section we present magnitude histograms, colour 
magnitude diagrams and two colour diagrams for represen- 
tative fields throughout the survey region. We discuss the 
survey depth as a function of Galactic coordinates and then 
describe the nature of the main Galactic (and extragalac- 
tic) populations that are detected. The Besangon Galactic 
stellar population modeQ (Robin et al.2003) is used to help 
interpret the diagrams and we then compare some diagrams 
for the synthetic population with the real data. 

Figures 1 to 13 show the results for 12 GPS fields. The 
fields at I <90° are square boxes that are 0.2 degrees on a 
side in I and b. The fields at I >90° are square boxes that 
are 0.5 degrees on a side in I and b, in order to provide 
similar sized samples to reveal the nature of the populations 
in these less crowded regions. All fields include data from at 
least 2 spatially separate WFCAM multiframes and usually 
more. Any errors in global photometric calibration would 
add to the scatter in these plots. However the quality of the 
2MASS-based photometric calibration is such that we see no 
obvious increase in scatter compared to data from individual 
WFCAM arrays. 



3.1 Survey sensitivity and the effect of source 
confusion 

Figure 1 shows the magnitude histograms. Visual inspec- 
tion of the images shows that the fields at I >90° have less 
source confusion than the other fields and source confusion 
is insignificant for the two fields near /=170°. The corre- 
sponding histograms represent the maximum intrinsic depth 
of the survey. We see that the modal depths in uncrowded 
fields are approximately K=17.75 to 18.0, H=18.5 to 18.75, 
J=19.4 to 19.65. The variation is due to the variable see- 
ing, sky brightness, thermal background and atmospheric 
transparency during the survey. These modal depths are 
slightly shallower than the 5cr depths quoted in §2: K=18.05, 
H=19.00, J=19.77. The reasons for this are described in §A3. 
By visually extrapolating the histograms and assuming that 
the decline at faint magnitudes is entirely due to limited 
sensitivity we estimate that the typical 90% completeness 



To gain access to this web page, UKIDSS members should 
request the user name and password from Chris Davis, email: 
c.davis@jach.hawaii.edu. The data become world public 18 
months after release to UKIDSS members. 

^ The Besangon models are available on-line at bison. obs- 
besancon.fr/modele 
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Figure 1. Magnitude histograms in a selection of GPS fields. Thick lines: K band; medium lines: H band; thin lines: J band. A linear 
scale is used rather than the conventional log scale in order to better illustrate the observed variation in modal magnitude between 
fields. Source counts per square degree are given, using bins of width 0.25 magnitudes. A general decline in source density is seen with 
increasing longitude and increasing distance from the mid-plane, though there are some exceptions to this trend. 
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Figure 2. (i, 6)=(-0.3,0). Colour magnitude and two colour diagrams for a 0.2x0.2° field adjacent to the Galactic centre. The two 
colour diagram, like those in figures 3 to 13, shows only sources with the "most reliable, low completeness" selection described in §A3. 
By contrast, the colour magnitude diagram, as in figures 3 to 13, shows all sources except "noise" detections. Sources with "unreliable 
photometry" (see text §3.2) are marked in blue. Both diagrams arc dominated by the highly reddened Bulge population. The less reddened 
sequences of disk dwarfs and disk red clump giants are also visible in the colour magnitude diagram but less clearly than in figures 3 to 
13. 



2.4 

2.2 

2.0 

1.8 

1.6 

: 1-4 

1.2 

1.0 

0.8 

0.6 

0.4 

0.2 
0.0 

























































































































































>I;iv^^'^^-^■J^■!^ 














■ M 



































































0.2 0.4 



0.6 O.E 
H-K 




Figure 3. {I, fe)=(15,0). This crowded field in the inner Galactic mid-plane shows a well defined dwarf sequence at the left of the colour 
magnitude diagram and a diagnonal red clump giant sequence to the right of it. The two colour diagram shows a distinction between 
giants and dwarfs. Both luminosity classes form sequences parallel to the reddening vector but giants have redder (J-H) colours than the 
dwarf sequence underneath (see text §3.2). The points plotted are selected in the same way as in figure 2. 



limits in uncrowded fields are located at K=18.0, H=18.75, 
J=19.5, with uncertainties of about 0.2 magnitudes. Indi- 
vidual fields in the outer Galaxy should always be within 
0.3 magnitudes of these typical values, since only a modest 
range of observing conditions (seeing and transparency) is 
permitted, as described in the data release papers. We do 
not attempt a more precise quantification of the mode and 
spread of the sensitivity at this stage since there is limited 
data for the outer Galaxy in DR2, and the sensitivity in 
future data releases is likely to change slightly. The seeing 
conditions required to begin an observation at I >107° were 
relaxed from a maximum FWHM of 0.8 arcsec (measured in 



the K bandpass) to 1.0 arcsec from February 2007 in order 
to speed the survey progress. 

It is clear that the survey is much less sensitive near 
the Galactic centre than in uncrowded fields. However the 
90% completeness limits and modal magnitudes are usually 
degraded by no more than 1 magnitude in fields at I >30°, 
even in the mid- plane at 6=0°. 

The effects of extinction on the histograms are apparent 
in the change in the relative numbers of J, H and K band de- 
tections in different parts of the Galaxy. The K band source 
density is the highest in the fields at I <30°. At I >30° 
the H band source density is higher (at least for the fields 
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Figure 4. (i, 6)=(15,1). The red clump giants form a diagonal sequence in the colour magnitude diagram, from mif=10.5 to 12, (J- 
K)=0.fi to 1.7, which then becomes approximately vertical at fainter magnitudes. This field is on a line of sight that gradually rises above 
the Galactic plane at large heliocentric distances. This leads to reduced extinction at large distances (see text §3.2 and figure 15), which 
explain the change in slope of the giant sequence at faint magnitudes. A substantial Bulge giant population is also seen to the right of 
the diagonal red clump giant sequence at rax < 12. The two giant populations appear to merge at fainter magnitudes. The reddening 
sequences in the two colour diagram are shorter than in figure 3. The points plotted are selected in the same way as in figure 2. 



4.0 

3.5 

3.0 

2.5 

2.0 

1.5 

1.0 

0.5 
0.0 





















A(V 


)=10 / 






























le— -7 








Giant 
equence 


























F 


MIIlM 












Dwa 


rf sequenc 






1 











9 
10 
11 
12 
13 
14 
15 

16|-- 

17 

18 
19 



'.%•>.•• -.U -'4.'... 



****** * ♦ * ♦ 



♦*•*** 



'ilire^-'^Ma^^ t VaXV •<•.?: .1".. :••*•»• 



"<•.?**•; If'". *• 



0.5 



1.0 1.5 
H-K 




Figure 5. (/, 6)=(31,0). The field shows very clearly defined dwarf and giant reddening sequences in the two colour diagram. The colours 
of unreddened stars from Hewett et al.(2006) are indicated by the solid lines, with approximate spectral types marked. The reddening 
vector of Rieke & Lebofski (1985) is overplotted. The colour magnitude diagram shows that the Bulge giant population has disappeared 
at this longitude, while the disk red clump giant population follows a winding diagonal sequence, indicating variable reddening along the 
line of sight. The points plotted are selected in the same way as in figure 2. 



at |6| <1° for which multi-band data presently exist). The 
source density in the J band becomes comparable to the H 
band at I >90°. 

3.2 Stellar and extragalactic populations 

The two colour diagrams and colour magnitude diagrams in 
figures 2-13 allow us to clearly identify the main stellar popu- 
lations detected in the GPS and detect changes in extinction 
as a function of distance along different lines of sight. Some 
readers will be familiar with similar diagrams constructed 



from data in the DENIS and 2MASS surveys (e.g. Ojha et 
al.2000) or from surveys of the stellar populations in ex- 
ternal galaxies. In general, the populations detected by the 
GPS consist of less luminous stars than those in most pre- 
viously published wide field infrared surveys. Consequently, 
the diagrams look quite different and their interpretation is 
often less obvious than might be supposed. We have cho- 
sen to plot K vs. (J-K) colour magnitudes since the other 
near infrared colours {(J-H) and (H-K)) provide less sepa- 
ration between the various stellar populations and add little 
to this discussion. In general, near infrared diagrams can 
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Figure 6. (i, fe)=(31,l). The two colour diagram is similar to that for (i, fe)=(15,l). However, the colour magnitude diagram shows a 
diagonal red clump giant branch that does not become vertical at faint magnitudes. This suggests that this line of sight passes through 
a region of the galaxy in which stars are located above the plane, perhaps due to a warp. The points plotted are selected in the same 
way as in figure 2. 
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Figure 7. (i, fe)={55,0). This field shows an abrupt change in the slope of the disk red clump giant branch in the colour magnitude 
diagram at mx=13, indicating the presence of a discrete cloud of dense interstellar gas and dust. The points plotted are selected in the 
same way as in figure 2. 



separate the nearby late dwarf, distant early dwarf, and red 
giant populations, with more complex behaviour depending 
on the sightline. 

In this section we first summarise the general properties 
of the two colour diagrams and then discuss the colour mag- 
nitude diagrams, beginning with the inner Galactic fields. 

The two colour digrams for the inner Galactic fields 
in figures 2 to 8 all show two parallel sequences of stars 
extending from the lower left to the upper right. The up- 
per sequence is composed mainly of giants (mostly of the 
type known as "red clump giants" ) while the lower sequence 
consists of dwarf stars. These long sequences are due to in- 
terstellar reddening: they lie approximately parallel to the 
reddening vector and can be used to estimate the interstel- 



lar extinction toward each star by comparing the observed 
colours with those of unreddened stars. The colours of a 
sample of unreddened main sequence and giant stars from 
Hewett et al.(2006) are plotted in figure 5 for illustration, 
along with the reddening vector of Rieke & Lebofsky (1985). 
Late type giants have ~0.15-0.3 mag larger (J-H) colours 
than dwarfs of the same spectral type, due to the effect of 
their low surface gravities on the opacity of the H~ molec- 
ular ion. The more distant dwarfs also tend to have earlier 
spectral types than the giants, which also contributes to the 
difference in (J-H) colours. 

A careful inspection of the reddened sequences of stars 
in the two colour diagrams shows that they are not quite 
straight lines. This is due to two effects. The first is that 
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Figure 8. (i, fe)=(55,l). This field shows a very strong discontinuity in the slope of the disk red clump giant branch in the colour 
magnitude diagram at mx=13 to 14, where the (J-K) colour increases from 1.2 to 2.4. This indicates the presence of a discrete cloud 
of dense interstellar gas and dust (see text §3.2). This may be related to the weaker feature in figure 7. At fainter magnitudes the giant 
sequence become vertical, indicating that the line of sight has risen above the plane to a region of low interstellar extinction. Note that 
the sequence of late K and M dwarfs (see text §3.2) becomes visible at mif =13 to 14.5, (J-K)=1.0. It protrudes from the bulk of the 
earlier type disk dwarfs at the left hand side and extends up toward the giant branch. The points plotted are selected in the same way 
as in figure 2. 




Figure 9. (Z, 6)=(98.4,0). The field lies in the outer galaxy and the extinction is now clearly less than in figures 2 to 8, as evidenced by 
the shorter reddening sequence in the two colour digram. A 0.5x0.5 degree field is plotted here and for the other outer Galaxy fields in 
order to maintain sufficient source counts to define the populations clearly. The giant branch in the colour magnitiude diagram becomes 
vertical at m/f >13. This indicates a lack of additional extinction at large heliocentric distances in the mid-plane, which is attributed to 
the northern Galactic warp. Note the population of external galaxies that is visible at lower right {(J-K)>2). These are mostly marked 
in blue because they are spatially resolved, and therefore fall into our "unreliable photometry" category, even though their fiuxes in the 
2" aperture are well measured. The points plotted are selected in the same way as in figure 2. 



the nature of the detected stellar population shifts towards 
more luminous stars with increasing distance and redden- 
ing. The clump of stars seen at the lower left of most of the 
two colour diagrams is composed mainly of nearby G, K and 
early M type dwarfs, which have a high space density. As 
we move further along the dwarf sequence the population 
shifts towards early G, F and A type dwarfs, which have 
a lower space density but are detected at larger distances. 



The dwarf sequence ends at a lower reddening than the gi- 
ant sequence because the dwarfs are less luminous. This 
change in populations means that the the dwarf sequence 
has a slightly shallower gradient than the reddening vector. 
The second reason that the gradient is not linear is that 
the effective bandpasses of the filters change with increasing 
reddening when observing a continuum source, so reddening 
vectors become curved as the extinction increases (e.g. Kaas 
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Figure 10. (i, fe)={98.4,l). The diagrams for this field appear very similar to those for (/, fe)=(98.4,0), in term of reddening. This is 
consistent with expectations due to the presence of the northern warp, which creates an overdensity of stars and interstellar extinction 
above the plane at large Galactocentric distances, relative to what would be expected if the outer Galaxy were centred in the plane. The 
points plotted are selected in the same way as in figure 2. 
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Figure 11. (/, fe)=(98.4,-l). This field on a line of sight below the mid-plane shows reduced reddening compared to figures 9 and 10, 
since the overdensity due to the warp is located at positive latitudes. The external galaxy population remains visible at lower right in 
the colour magnitude diagram and the late K and M dwarf dwarf sequence is faintly visible at mj<-=12.6 to 14.5, (J-K)=0.7 , between 
the bulk of the FG dwarf sequence and the giant branch. The points plotted are selected in the same way as in figure 2. 



et al.l999). The precise form of the curve differs for sources 
with different spectra, so there is no universal reddening 
vector, see e.g. Drew et al.(2005). The giant sequence gener- 
ally begins at a slightly higher reddening than the clump of 
nearby late type dwarfs, since giant stars have a lower space 
density. This population can in principle be used to mea- 
sure the reddening curve for red clump giants. In general a 
standard linear interstellar reddening vector is adequate for 
most applications and this approach is used in this work. 
A more sophisticated approach using a model of the instru- 
mental transmission and stellar model atmospheres will be 
developed in a future paper on Galactic structure. 

All of the two colour diagrams show samples selected 
using the least complete, most reliable selection described 



in §A3, with errors <0.05 mag in the colours on each axis, 
in order to distinguish the closely spaced stellar populations. 
Including fainter sources with less precise photometry sim- 
ply increases the scatter in the diagrams, as shown in figure 
Al(a), and extends the dwarf and giant sequences to slightly 
higher reddening values at the upper right. 

By contrast, the colour magnitude diagrams show all 
the three band detections within the chosen axis ranges, ex- 
cept those classified as "noise" detections (see §A3). Sources 
with more reliable photometry are shown in black, while 
sources with less precise photometry are shown in blue. 
Sources are put in the "less precise" selection if they fail 
to satisfy any of the following criteria. (1) Photometric un- 
certainties <0.2 mag in the two bands used in each dia- 
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J-K 

Figure 12. (i, fe)=(171.1,4.75). This field lies on a line of sight almost directly opposite the Galactic centre and 4.75° off the plane. The 
stellar populations are very different from those in figures 2 to 10: the right hand sequence of stars in the colour magnitude diagram is 
now dominated by nearby late K and M dwarfs, as opposed to red clump giants. Giants are more numerous than late K and M dwarfs 
only at m^ < 13, but they are not clearly distinguished from the dwarfs in the colour magnitude diagram. The left hand sequence is 
dominated by F and G dwarfs. The rapid change in (J-K) colour with spectral type among late G and early K dwarfs causes the gap 
between the sequences (see figure 14). The lower panel shows a density map of the colour magnitude diagram (a "Hess diagram"). It 
shows that the two sequences join at faint magnitudes, where the early K types are more numerous and there also is a larger population 
of late K and early M dwarfs. The two colour diagram shows that there is little reddening in this field. The points plotted are selected 
in the same way as in figure 2. 



gram. The corresponding less precise sources are faint ob- 
jects lying in a group at the bottom of each diagram. (2) 
ppErrbits<256. This parameter is decribed in §A3. Most 
sources failing this criterion do so because they are flagged 
as close to saturation, and therefore lie in a group at the top 
of each diagram. Inspection of the diagrams shows that they 
nonetheless often play a valuable role in defining the differ- 
ent stellar populations, indicating that the errors in their 
photometry need not be extreme. (3) pstar>0.9. Sources 
only fail this criterion if they appear significantly non-stellar 
in at least one of the three passbands, usually because they 
are either part of a marginally resolved stellar pair or a re- 
solved external galaxy. Marginally resolved stellar pairs are 
evenly spread among the sources with more reliable pho- 
tometry but extend further to the left and right sides of the 
diagrams. The resolved galaxies are located in a group at 



the lower right of the colour magnitude diagrams. The ex- 
tragalactic group is clearly seen only in the fields at / > 90° . 

The colour magnitude diagrams all show at least two 
principal sequences of stars. On first inspection, one might 
suppose that these correspond to luminosity class V dwarfs 
on the left side of the diagrams and luminosity class III red 
clump giants on the right. This is indeed the case for the 
inner Galactic fields, which we will now discuss. However, 
the situation is more complicated in the outer Galaxy, as we 
shall see later. 

The main feature of the colour magnitude diagrams for 
Z=15, 31, 55 and 98.4 (see figures 3 to 11) is the division 
into two principal sequences corresponding to dwarfs on the 
left and giants on the right. The Besangon model indicates 
that the majority of the giants are K1III-K2III types. These 
stars occupy a sharp and well defined peak in the absolute 
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Figure 13. (i, 6)=(173.65,0.65). The colour magnitude diagram for this field shows some similarity to figure 12, but exhibits greater 
extinction due to the location near the mid-plane. The principal sequences observed remain F and G dwarfs at the left of the diagram and 
late K and M dwarfs slightly to the right (mixed with red clump giants at oik < 14). The population of external galaxies is prominent 
at (J-K)=1.5 to 2.5, mj^ >15. The two colour diagram appears different to that in figure 12 because there is sufficient extinction to 
redden the relatively distant population of F and G dwarfs and move them close to the location of the nearer, less reddened, late K and 
early M dwarfs. This creates a poorly defined mass of stars in which different spectral types are poorly distinguished. The points plotted 
are selected in the same way as in figure 2. 






















■ ij 


































■ ■ -" - 






■ V 
- n 




















■ b 

- n 




















■ a 





■ ■ ^ . 

L-^ L-1 










































■■ ■■■-^^^^ 

















































c oa c.oa d. c o.'i a2D o.2s o.m o.:a d.'-d d.-'s 



Figure 14. Besangon model results with synthetic errors. Upper panels: (i, 6)=(31,0). Lower panels (i, 6)=(171.1,4.75). These models 
reproduce the data fairly well, although our use of synthetic errors based on the archival values has underestimated the true errors at 
bright magnitudes (see §A2), causing the various stellar sequences to appear narrower than in the real data. The small, well defined 
sequence of rather red white dwarfs with mx ~14 in the plots for (/, fe)=(171.1,4.75) is not observed in figure 12 (see text §3.3). The A 
type stars near the origin in the lower left panel are not measured by the GPS because they are saturated. Points with synthetic errors 
>0.2 mag are plotted in blue. 
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Figure 15. Besan§on model K vs. (J-K) diagrams witli no errors. Main sequence stars are sliown in blue, luminosity class III giants 
are shown in black, subgiants (class IV) are shown in green, white dwarfs are shown in red and luminous giants (classes I and II) are 
shown in pink. Left panel: (i, 6)=(31,0), showing a fairly simple division between dwarfs and subgiants on the left and the red clump 
giant branch to the right. Right panel: (i, 6)=(15,1), showing the numerous sub-sequences of giant stars which comprise the red giant 
clump giant branch. The sequence of white dwarfs at the lower right is not observed. 



magnitude function of Class III giants, with absolute magni- 
tude Mx=-1.65 (e.g. Lopez-Corredoira et al.2002), though 
we caution that both flux and colour have some dependence 
on metallicity and age (Salaris & Girardi 2002; Luck et 
al.2007; Gullieuszik et al.2007). These giants form the well 
defined "red clump" in H-R diagrams for globular clusters 
and galaxies toward which the extinction is fairly uniform. 
They are therefore a useful tracer of stellar distance, ex- 
tinction and population density, (see Lopez-Corredoira et 
al.2002) but only in fields in which they can be distinguished 
from dwarfs and other types of giant. 

The dwarf and giant sequences do not always show a 
smooth increase in reddening with distance. The giant se- 
quences in the (Z,6)=(55,l) and (55,0) fields (see figures 
8 and 7) show an abrupt increase in reddening between 
mK=13 and rriK — U- Such features can be explained by 
discrete clouds of interstellar dust. A similar feature occurs 
in the (Z,fe)=(15,0) field (figure 3), where there is a sud- 
den increase in reddening at mx=12.75, (J-K)=2.25. The 
distance to the cloud and the extinction in front of it and 
within it can be simply determined from the known luminos- 
ity and colour of the red clump giants and by adopting an 
extinction law that is appropriate for the diffuse interstellar 
medium (e.g. Rieke & Lebofsky 1985; Cardelli, Clayton & 
Mathis 1989). (Here we neglect the effect of the change in 
effective bandpasses at high extinction that were mentioned 
above). In the case of the field at {I, b) — (55,1) the cloud is 
located at a heliocentric distance of 8 kpc and it introduces 
a visual extinction of Av~7.5 magnitudes. Since the feature 
at mx=13-14 is well populated with red clump giants it is 
likely to be caused by a large cloud of atomic gas rather 
than a relatively small dense molecular cloud. Inspection of 
the "CO data cube from the BU-FCRAO survey (see e.g. 
Shah et al.2004) shows weak but uniform emission across the 
field, with much brighter emission outside the field on both 
sides. The emission is too weak by more than an order of 
magnitude to explain the infrared extinction, which tends to 
support the interpretation of the cloud as an atomic feature 
rather than a molecular one. 



An advantage of the near infared determination of ex- 
tinction (as opposed to optical determinations) is that a 
"universal" extinction law applies to both the diffuse inter- 
stellar medium and molecular clouds (Cardelli et al.l989). 
This is due to the fact that the dust grains responsible for 
extinction have a size much smaller than the wavelength of 
observation for wavelengths A > 1 /im (except in extreme en- 
vironments such as the accretion discs around protostars). 
Small variations in the extinction law can still occur, e.g. due 
to variations in grain composition (and hence refractive in- 
dex) . This may explain why some authors have found values 
of the colour excess ratio E(J-H)/E(H-K) ranging from 1.32 
to 2.08 in some dark clouds (e.g. Moore et al.2005; Racca, 
Gomez & Kenyon 2002), bracketing the value of 1.70 found 
in Rieke & Lebofsky (1985). However, most measurements, 
e.g. Naoi et al.(2006) find that E(J-H)/E(H-K) lies in the 
range 1.6 to 1.7, and they note that some of the reported 
variations are attributable to differences in photometric sys- 
tems. 

Some of the colour magnitude diagrams show an abrupt 
change in the slope of the giant branch for another reason. 
For example, the plots in figures 4 and 9 for {I, b) — (15,1) 
and (98.4,0) show diagonal sequences of red clump giants 
at rriK < 12 and rriK < 13 respectively, and then a change 
to an almost vertical giant sequence at fainter magnitudes. 
This change is due to the red clump giant population leav- 
ing the main body of the Galactic disk at large distances. 
The position of stars along the red clump giant sequence 
is determined mainly by distance. At heliocentric distances 
d >12 kpc, the sequence in the {I, b) — (15,1) plot rises far 
enough above the dusty Galactic Plane for extinction to 
become almost independent of distance. The similar be- 
haviour of the {I, 6) = (98.4,0) plot is attributed to the north- 
ern Galactic warp, which moves the dusty plane away from 
6=0 at large Galactocentric radii. In figure 15 we show a Be- 
sangon model for {I, &) = (15,1) which reproduces and clarifies 
the behaviour at d >12 kpc. We see that the red clump gi- 
ants are split into a number of faint parallel sub-sequences. 
These correspond to the various evolutionary groups: core 
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helium burning horizontal branch giants, fainter hydrogen 
shell burning stars ascending the red giant branch and 
asymptotic giant branch stars with hydrogen and helium 
shell burning (e.g. Robin 1989). Each evolutionary group is 
also split into multiple sequences duo to binning by spec- 
tral sub-type in the Besangon model. Each sequence has a 
slightly different effective temperature and some, e.g. the 
K type red giant branch sequences, have lower luminosi- 
ties. At d >12 kpc the sub-sequences continue to become 
slightly redder at fainter magnitudes (i.e. larger distances) 
since there is a small amount of dust above the Galactic 
plane. However the sub-sequences all end ~2 magnitudes be- 
low the change in slope, as the old disc population declines 
to zero at large distances above the plane. At fainter mag- 
nitudes, new sub-sequences of slightly fainter giants with 
slightly bluer (J-K) colours (which we believe axe stars as- 
cending the red giant branch) come to dominate the giant 
branch. In the GPS data the adjacent sub-sequences merge 
together, with the result that in the vertical part of the giant 
sequence the stars become slightly bluer in (J-K) colour at 
fainter magnitudes. 

The existence of less luminous hydrogen shell burning 
stars in the red clump indicates that the red clump method 
for determining distances, extinction and stellar population 
density should be used with caution at faint magnitudes. 
The Besangon model catalogue displayed in figure 15 for 
(l,b)=(15,l) indicates that at mx ^ 15.5 these less luminous 
giants dominate the red clump. 

The field neax the Galactic Centre at (/, 6)=(-0.3,0) has 
a rather different colour magnitude diagram from the other 
fields. Three principal sequences arc apparent in figure 2. 
The largest population is giants in the Galactic bulge, with 
3 <(J-K)< 1. According to the Besangon model these have a 
wide range of spectral types and metallicities. However most 
suffer similar reddening, since they are all at distances of 7- 
10 kpc (according to the model). The other populations are 
the nearby disc dwarfs at the left side of the diagram and the 
disc red clump giants, which form a diagonal sequence with 
{ J-K) colours intermediate between the other two groups. 

Moving to the outer Galactic fields, we consider first 
the diagrams in figure 12, for (?, b) = (171.1, 4.75). This 
shows a field with negligble reddening, which can be dis- 
cerned from the corresponding two colour diagram. Two 
principal sequences of stars are apparent in the correspond- 
ing colour magnitude diagram, mostly with reliable pho- 
tometry. A more diffuse clump of sources with "less reli- 
able" photometry is situated near (J-K)'^\.'&^ mK=17. This 
clump is composed of external galaxies (mergedClass=+l , 
pstar<C 0.9), and the corresponding luminosity function 
increases down to the completeness limit (see §4.5). Inspec- 
tion of the Besangon model catalogue for these coordinates 
(plotted in the lower panels of figure 14) indicates that the 
overwhelming majority of the sources in both of the two 
principal sequences arc actually luminosity class V dwarfs. 
The left hand sequence is composed of F and G type dwarfs 
(and some subgiants of luminosity class IV) . The right hand 
sequence is composed mainly of M and late K type dwarfs, 
with only a small proportion of giant stars. Luminosity class 
III giants are restricted to the upper portion of the right 
hand sequence with rriK < 13.5. The dwarf component of this 
sequence begins at ttik R^12 and extends down to the faintest 
sources at the bottom of the diagram. The overlapping dwarf 



and giant populations in the right hand sequence are not 
distinguishable in the magnitude range 12< vtik <13.5. The 
two colour diagram also fails to separate them, although it 
should in principle reveal any large population of early K- 
type red clump giants as an overdensity near ('J-J?)=0.57, 
(H-K)=Q.V1. 

The reason that two sequences of dwarf stars are de- 
tected at (i, 6)=(171.1, 4.75) is not immediately obvious. In- 
spection of the density map of the colour magnitude diagram 
shown in the lower panel of figure 12 (a "Hess diagram") 
and the Besangon model shows that there is a connection 
between the two sequences at 17< vcik <18. However, the 
M type dwarfs, which arc the most numerous population 
in the diagram, extend above and below this connection. 
Nearby late K and early M-type dwarfs have a narrow range 
of colours ((^J-i^TjwO.Q) and they are sufficiently numerous to 
define the right hand dwarf branch at 12< vcik <17. Simi- 
larly the distant F and early G-type dwarfs observed in this 
magnitude range also have a narrow range of colours. By 
contrast the late G and early K type dwarfs arc spread over 
a wider range of (J-K) colours due to the steeper colour- 
temperature relation for these types. Their numbers are sim- 
ilar to the earlier type dwarfs in this magnitude range, so the 
rapid change in colour with temperature creates the sparsely 
populated gap between the two branches and the bimodal 
colour distribution. At mK>17 the late G and early K type 
dwarfs are sufficiently numerous to fill the gap. 

The two sequences of dwarfs arc present in all the 
colour magnitude diagrams plotted but they are less ob- 
vious at I < 100°. This is due to a combination of factors: 
(i) detectable F and G type dwarfs greatly outnumber M 
dwarfs near the mid-plane at I < 100°; (ii) there is a trend 
of increasing reddening with decreasing Galactic longitude, 
which causes the distant highly reddened F and G dwarf 
sequence to merge with K and M dwarf sequence; (iii) the 
population of distant red clump giants is much larger and 
produces a more obvious second sequence of stars to the 
right of the dwarf sequence that is clearly separated from 
the common F, G, K and M type dwarfs by high reddening. 
The K and M type dwarf branch is still detectable in some 
of the colour magnitude diagrams for 1=98.4 and 1=55 (see 
figures 8, 9, 10 and 11) appearing as a faint protuberance 
above the mass of dwarfs at rriK w 14. 

The clump of external galaxies seen in the colour magni- 
tude diagrams for the I > 90° fields is not detected at smaller 
longitudes, owing to the combination of higher extinction 
through the plane and the reduced sensitivity caused by 
source confusion. 

The two colour diagrams show clearly the trend of in- 
creasing extinction with decreasing Galactic longitude and 
latitude. The dependence on Galactic latitude is apparent 
even in the outer Galaxy, where extinction is relatively low. 
In the (Z,6)=(171.1, 4.75) field (see figure 12) the densest 
cluster of sources near (J-H)=QA is composed of F and 
G type dwarfs. In the (/, 6) = (173.65,0.65) field (see figure 
13), we see that the F and G types have moved up to (J- 
H)=0.5, owing to the significant extinction present in the 
mid-plane toward these relatively distant sources (1 to 6 
kpc). By contrast, the K and M type dwarfs with (J-H)=0.5 
to 0.7 lie in the same place in both diagrams, owing to the 
much lower extinction toward these nearer and intrinsically 
fainter sources (<3 kpc distance). The effect of the northern 
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Galactic warp is apparent in the two colour diagrams for 
1=98.4. The extinction at (Z, 6) = (98.4,1) is similar to that at 
(/,6)=(98.4,0) and clearly greater than that at (/, 6)=(98.4,- 
1), as shown in figures 9, 10 and 11. 

3.3 Further comparison with the Besangon 
Models 

The Besangon stellar population model has been used to aid 
the interpretion of the GPS data discussed above. In figures 
14 and 15 we show some of the diagrams produced by the 
model in order to illustrate the quality and the limitations 
of such comparisons. The results in figure 14 have synthetic 
errors included in the simulation. The error curve used was 
based on a fit to the errors as a function of magnitude given 
in the WSA for each passband in the appropriate field. As 
noted in §A2, this will tend to underestimate the true errors 
(particularly for bright stars), but it is adequate for a simple 
comparison. 

The visual extinction per kiloparsec in the diffuse inter- 
stellar medium (ISM) is a free parameter in the Besangon 
models. Discrete clouds may also be added to model strong 
extinction features, as noted earlier. The ISM extinction 
parameter was adjusted by trial and error until the best 
combined match by eye to the observed luminosity func- 
tion and the K vs. ( J-K) colour magnitude diagram was pro- 
duced. The derived extinction was Av=2.25 mag/kpc for 
the (Z,6)=(31,0) simulation and Av=0.7 mag/kpc for the 
{1, 6)=(171.1,4.75) simulation. The models use the extinction 
law of Mathis (1990), assuming Rv=3.1, so the equivalent 
K band extinctions are Ak=0.242 mag/kpc and A_r-=0.075 
mag/kpc. The default value of Av=0.7 mag/kpc is appro- 
priate for the diffuse ISM and will usually be correct in fields 
with low extinction. Higher values are required to fit fields 
with high extinction. In reality high extinction is expected to 
be caused by numerous discrete molecular clouds, but often 
the individual clouds have too small an effect to be resolved 
in the data. We note that some authors quote a slightly 
higher value for the extinction in the diffuse ISM, e.g. Whit- 
tet (1990) quotes 0.8 mag/kpc. This difference is likely to be 
due to the choice of distance scale in the Besangon models, 
which places the Galactic centre at d=8.5 kpc from the sun. 
The distance to the supermassive black hole at the Galac- 
tic Centre has been reliably determined by matching the 
observed angular size of stellar orbits to the corresponding 
stellar radial velocities: d=7.62±0.32 kpc (Eisenhauer et al. 
2005). Hence some users may prefer to scale the Besangon 
distances and extinction values accordingly. 

The two colour diagrams in figure 14 for (/,6)=(31,0) 
and (171.1,4.75) are a good quantitative match to the 
data. The blue end of the model dwarf sequence for 
(Z, 6)=(171.1,4.75) appears sharper than the observed se- 
quence in figure 12, since the photometric errors of these hot 
and bright stars are underestimated in the model. The other 
noticeable discrepancies in the (171.1,4.75) plot are: (i) the 
appearance of a cluster of sources at the upper right of the 
diagram {(H-K)=Q.A'a^ (J-H)=0.85), which is not observed; 
and (ii) the sparesely populated sequence of stars extending 
off the top of the diagram at (H-K)=0.2-0.25. The first group 
represents K type white dwarf stars in the model, which all 
have very similar masses and spectral types. The data sug- 
gests that the real white dwarf population has less uniform 



parameters than in the model. The second group represents 
late K-type giants in the model. These are not included in 
figure 12 because they are too bright (i.e. heavily saturated 
sources). The two colour diagram for the (/, 6)=(31,0) model 
shows a similar small population of luminous giants at (H- 
K)=2.5-3.0 (late M-types in that case) that is not seen in 
figure 5 for the same reason. Such bright populations are 
better studied with the 2MASS or DENIS surveys. 

The colour magnitude diagram for the (Z,6) = (31,0) 
model is in close agreement with the data in figure 5. The 
only difference is that slope of the red clump giant sequence 
at the upper right is more constant in the model than in real- 
ity. This reflects the patchy nature of interstellar extinction. 
The colour magrntude diagram for the (Z, 6) = (171.1,4.75) 
model is also in fair agreement with the corresponding data 
in figure 12. The following differences are obvious however. 
(1) The external galaxies at the bottom right of the plot 
in figure 12 arc not in the Besangon model. (2) The two 
sequences of dwarf stars discussed earlier appear as three 
sequences at 0.4:5<(J-K)<0.9 in the model. The sparsely 
populated middle sequence at (J-K)=0.65 corresponds to 
early K dwarfs in the model. This may well be a real fea- 
ture but it is not apparent in figure 12 due to the larger 
photometric errors in the real data. (3) The short vertical 
sequence of red sources at (J-K)=1.25 seen in the model are 
the K-type white dwarfs; these are not seen in the data, as 
noted earlier. 

In figure 15 we show Besangon synthetic colour mag- 
nitude diagrams with no errors added, for (Z,6)=(31,0) and 
(15,1). These illustrate the discrete binning of sources in the 
Besangon model grid. The synthetic plots for {1, 6)=(31,0) 
appear very similar in figures 14 and 15 since distance and 
interstellar reddening introduce numerous degeneracies in 
magnitude and colour between different spectral types. In 
contrast, the plot for (Z,6)=(15,l) clearly shows the numer- 
ous red clump giant sub-sequences (see discussion in §3.2). 
These appear well defined in the model because each sub- 
type of giant star has a unique luminosity, effective temper- 
ature and colour in the model and there is no additional 
extinction to merge the sub-sequences in the dust free re- 
gion above the Galactic plane. Some of the numerous sub- 
sequences correspond to the various evolutionary stages on 
the HR diagram, as noted earlier. In addition, the effect of 
binning by spectral sub-type in the model creates an arti- 
ficial division which further multiplies the number of sub- 
sequences. We note that the more luminous giants above 
the most populated part of the red giant clump appear to 
be more numerous in the data than the model. 



4 DEMONSTRATION SCIENCE 

4.1 Identification of Young Stellar Objects with 
the UKIDSS GPS and the Spitzer GLIMPSE 

survey 

One of the principal science goals of the GPS is to detect and 
characterise large numbers of star formation regions, in or- 
der to examine the effect of cnviromncnt on star formation. 
Young Stellar Objects (YSOs) can be identified and classi- 
fied by the excess emission from hot dust around the central 
protostar at wavelengths A > 2 //m. The GPS can detect this 
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Figure 16. The synergy of UKIDSS-GPS and Spitzer-GhlMPSE data, (top) K band image of the central parts of a star formation region 
in the mid-plane: G28. 983-0. 603. Sources detected by GLIMPSE at both 3.6 and 4.5 /im are marked with white triangles. A 0.5 arcsec 
matching radius was used. Sources identified as Class I or Class II YSOs in the lower right panel are also marked with red squares in 
the image. The YSOs show a slight concentration towards the centre of the cluster. The lower panels show how the combination of near 
IR data from the GPS with mid-IR data from the Spitzer GLIMPSE legacy survey can be used to efficiently identify and classify YSOs 
by their K-4.5 ^m excesses. JHK detections are plotted as filled circles in the lower right panel, while sources undetected in the J band 
are plotted as open circles. The near infrared reddening vector of Rieke & Lebofsky (1985) is plotted in the lower left panel. The lines 
in the lower right panel that divide the YSO classes are parallel to the reddening vector of Flaherty et al.(2007). 



excess in some YSOs by using the ( J-H) ys.(H-K) two colour 
diagram. However, the excess is larger and therefore more 
easily measured at mid-IR wavelengths. Mid-IR data from 
the Spitzer Space Telescope alone can be used to identify 
and classify YSOs (e.g. Whitney et al.2004; Churchwell et 
al.2004; Allen et al.2004; Megeath et al.2004). However, this 



requires a detection in three or four Spiteer-IRAC passbands 
and the relatively poor sensitivity and spatial resolution of 
the longer wavelength passbands at 5.8 and 8.0 limits 
the number of YSOs that can be classified. There is there- 
fore a natural synergy between the GPS and the mid-IR 
Spitzer GLIMPSE legacy survey of the mid-plane (Benjamin 
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et al.2003), particularly since the 2MASS survey generally 
has insufficient depth to adequately match GLIMPSE in star 
formation regions. 

This is illustrated in figure 16 for the young cluster 
G28. 983-0. 603, which is coincident with an HII region ob- 
served by Lockman (1989). This cluster was identified by 
Bica et al.(2003) using 2MASS data and is listed as cluster 
10 in their catalogue. A fair number of YSOs are identified 
via their K band excesses in the near IR two colour diagram 
(lower left panel), lying to the right of the principal sequence 
of reddened field stars and the more evolved "discless" Class 
in YSOs. However, the median (H-K) colour excess of YSOs 
is only ~ 0.2 magnitudes (e.g. Wilking, Greene & Meyer 
1999), so it is hard to distinguish cluster members with mod- 
est K band excesses from field stars. The lower right hand 
panel shows that the YSOs are far more clearly identified by 
using the (K-4.5) colour (many showing excesses >0.5 mag- 
nitudes) where the 4.5 /im flux comes from Spitzer/YRAC 
band 2. The horizontal axis in the lower right panel gives 
the approximate visual extinction, estimated from the (H- 
K) colour. In the lower right panel we plot the 1345 sources 
within a 6x6 arcminute box centred on the cluster that are 
detected in all of the J, H , K, 3.6 and 4.5 jiin bands as filled 
circles. Sources which lack a J band detection are plotted as 
open circles. A 0.5 arcsec matching radius was used to com- 
bine the GPS and GLIMPSE data, since the default IPAC 
matching radius of 1.2 arcsec was found to create numerous 
false matches in this crowded field. We note that there arc 
only 128 four band 5piteer-IRAC detections in this field, so 
the near IR/mid-IR combination increases the sample size 
by an order of magnitude. There is also an order of magni- 
tude increase in the sample size for the YSOs. 

The sources in the lower right panel of figure 16 are di- 
vided into Class I YSOs, Class II YSOs and Class Ill/field 
stars according to their K-A.^ /xm excess by straight lines 
that are parallel to the reddening vector. The Class Ill/ficld 
stars form a contirmous sequence parallel to the reddening 
vector but show an abrupt increase in the source density 
at Av ^ 12. Sources with a (K-A.Z) excess are also located 
mostly at Av > 12 (typically at 12 < Av ^ 30) so we can 
infer that the cluster lies behind 12 magnitudes of visual 
extinction and that there is significant extinction within the 
cluster. Thus the near IR data allow us to identify stars 
with Av ^ 12 as the most likely cluster members. Lockman 
(1989) measured a radial velocity of 52.6±1.3 km/s for the 
HII region. Adopting a Galactic rotation model with a rota- 
tion speed of 220 km/s and 7.62 kpc as the distance to the 
Galactic centre (Eisenhauer et al.2005) we derive a kine- 
matic distance of 3.1 kpc or 10.2 kpc. Inspection of the K 
vs. (J-K) colour magnitude diagram for stars in the same 
WFCAM field indicates a preference for the closer distance. 
This is because of stars in the red giant branch with Avwl2 
have m/f =12-14, corresponding to distances of 2.9-7.3 kpc. 
The large spread is due to highly variable extinction across 
the field. The cluster lies in a extensive region of high ex- 
tinction oriented east-west across the field, so mx~12 and 
d~3 kpc is a reasonable distance to derive using the red gi- 
ant branch and this is consistent with the nearer kinematic 
distance. The GPS data are fully complete to mK~16.0 in 
this crowded field and are ~50% complete to mK=17.25. 
At d=3.1 kpc, m/f =17.25 corresponds to very low mass pre- 
main sequence stars with masses as low as M~0.095 Mq (us- 



ing the NextGen isochrone of Baraffe et al.l998 for an age of 
1 Myr) in the absence of extinction. Such sources would be 
detected if they were ejected from the region of high extinc- 
tion in which most of the cluster lies. For cluster members 
with a more typical extinction of Av='21 (A/.s:=2.35), the 
50% completeness limit corresponds to low mass stars with 
M~0.5 Mq. 

The locations of the dividing lines were determined us- 
ing information from Gutcrmutli (2005) in which Class I, 
Class II and Class III YSOs (with classifications determined 
from 4 band IRAC detections) were plotted in the dered- 
dened (if -3. 6) vs. (3.6-4.5) two colour diagram. Gutermuth 
(2005) found that the different YSO classes were easily lo- 
cated in that diagram since the colours correlated excel- 
lently with the colours in the (3.6-4.5) vs.(5. 8-8.0) IRAC two 
colour diagram. The near IR and mid-IR passbands with the 
best sensitivity to reddened young stars are H, K, 3.6 /xm 
and 4.5 fiin. Gutermuth (2005) therefore devised a sophis- 
ticated scheme to classify the numerous YSOs detected in 
only these 4 passbands. To briefiy summarise the method, he 
dereddened sources with J band detections to the classical 
T Tauri locus of Meyer, Calvet & Hillenbrand in the (H-K) 
vs. ( J-H) diagram and thereby determined the individual ex- 
tinction values. He used that information to plot YSOs in a 
dereddened (H-K) vs. (3.6-4.5) two colour diagram and find 
the the best fit locus of YSO colours. The larger sample of 
sources which lack J band detections were then dereddened 
to this locus in the observed (H-K) vs. (3.6-4.5) diagram, 
taking advantage of the fact that the locus is incUned to the 
reddening vector by a large angle in that diagram. Finally, 
he used the extinction values for the full sample to classify 
YSO candidates using their location in the dereddened {K- 
3.6) vs. (3.6-4.5) diagram. This diagram was found to be the 
best diagram for clasifying YSOs, owing to the significant 
infrared excesses of protostars on both axes. 

In the lower right panel of figure 16 we have used a 
scheme which is simpler in some ways but is more effec- 
tive in crowded GPS/GLIMPSE fields. We find that the 
dereddened {K-3.6) vs. (3.6-4.5) diagram does not readily 
detect the YSOs in this field due to the far larger num- 
ber of field stars and the large uncertainties in the 3.6 and 
4.5 /xm fluxes, which cause many stars to scatter into the re- 
gion occupied by YSOs. These large uncertainties are due to 
the fact that the GLIMPSE legacy survey is shallower than 
most Spitzer GO observing programmes: it was designed to 
provide quite literally a glimpse of a very large area of the 
Galactic plane. In comparison the GO programmes also usu- 
ally have fewer field stars contaminating the sample since 
the nearby star formation regions in the Gould belt often lie 
off the Galactic plane. We use a simpler formula for extinc- 
tion: {Av = {H - K - 0.2)/0.063), which is based on the 
extinction law of Rieke & Lebofsky (1985), with (H-K)=0.2 
adopted as the typical intrinsic colour of most stars. This 
formula docs not take account of the fact that YSOs will 
have an (H-K) excess due to hot circumstellar dust. How- 
ever, by avoiding use of the GLIMPSE fluxes in calculating 
extinction we greatly reduce the scatter in the lower right 
panel of figure 16. Like Gutermuth, we use the extinction 
law of Flaherty ct al.(2007) to determine the extinction in 
the mid-IR bandpasses, relative to the K band. Flaherty et 
al.(2007) report a possible difference in the extinction law 
for dark clouds and for the interstellar medium, the latter 
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being calculated by reanalysing the data of Indebetouw ct 
al.(2005). The diflFerence could increase the gradient of the 
dividing lines in the lower right panel of figure 16 by 5%, 
leading to a different classification for a small number of 
sources. Wc note that the common method of dorodden- 
ing to the Classical T Tauri star (CTTS) locus of Meyer et 
al.(1997) runs into difficulties in GPS datasets containing a 
variety of dwaxf and giant field stars that do not lie on the 
locus. In any case, the CTTS locus requires fluxes in the J 
bandpass, which arc often unavailable. 

We therefore recommend the use of diagrams similar to 
the lower right panel of figure 16 for simple classification of 
YSO candidates in GPS data. The extinction axis helps to 
identify stars that arc likely to lie in front of or behind the 
cluster and the (if -4. 5) colour contains most of the informa- 
tion in the dereddened (if-3.6) vs. (3.6-4.5) diagram. The 
use of the (H-K) colour to determine extinction will lead to 
a systematic overestimate of the extinction towards YSOs, 
but a typical (H-K) excess of 0.2 magnitudes will lead an 
overestimate of only 3 magnitudes in the extinction, which 
is small in the context of figure 16. After the likely YSOs 
have been identified this bias can be allowed for. 

Wc caution that there arc limits outside which these 
YSO clasifications become unreliable. Gutermuth (2005 and 
private comm.) noted that his classification system was cal- 
ibrated using low mass protostars. It is not calibrated for 
more massive YSOs (Herbig Ae/Be stars) or for proto-brown 
dwarfs, which the Class I/II/III system of Lada et al.(1984) 
was not designed for. Herbig Ae/Be stars are likely to be the 
most numerous types of YSO detected in distant GPS clus- 
ters so further work will be needed to devise a classification 
scheme for them. 

4.2 GPS data in nebulous stcir formation regions: 
M17 

The well known high mass star formation region M17 was 
included in the UKIDSS Science Verification programme un- 
dertaken in April-May 2005, before the start of survey op- 
erations. A single WFCAM tile was observed in the three 
near IR passbands, using the same total exposure times as 
the GPS. The central portion of the K band image is shown 
in figure 18. It was noted in Lawrence et al.(2007) (see also 
§A2) that the WSA fails to include almost all the stars that 
are apparent in the regions of brightest nebulosity. This is 
due to a limitation of the source detection algorithm in the 
CASU reduction pipeline that is planned to be fixed from 
DR5 onward. Here we show the results of our own crowded 
field profile fitting photometry of the images from the WSA 
with IRAF/DAOPHOT and a comparison with the pub- 
lished data of Jiang et al.(2002), which covered a much 
smaller part of the same field. Only the central 4 arrays out 
of the 16 in the WFCAM tile contain bright nebulosity, and 
then only in a minority of each array. We used a 5(7 thresh- 
old for the source detection with DAOFIND for each array 
in each filter. Visual inspection showed that this produced 
very few spurious detections. Imago profiles were generated 
using ~15 isolated stars in each array, assuming a spatially 
independent profile. The photometry was calibrated by com- 
parison with the WSA catalogue for each array, using the 
median zero point correction for the sources common to both 
catalogues. Finally, the photometry for these four arrays was 



band merged in TOPCAT with a 0.5 arcscc pair matching 
radius and then merged with the WSA catalogue for the 
other 12 arrays in the tile. A total of 788841 sources were 
detected in the GPS field. 

In figure 17(a-b) wc show the linear and cumulative K- 
band luminosity functions (LFs) of regions IC, IN, IS and 
2C defined by Jiang et al. (2002) (these are illustrated in the 
right hand panel of figure 18.) While the overall LFs exhibit 
similar characteristics to those in figure 7 of Jiang et al., 
figure 17(c) shows quantitatively to what extent both sets of 
LFs are in agreement. In general, the WFCAM-derived LFs 
contain more objects at fainter K-hwd magnitudes than 
those of Jiang ct al., and reach a similar limiting magnitude. 
The number of sources detected in each region in the GPS 
data is (1047, 7402, 3658 and 1711) for (IC, IN, IS and 2C) 
respectively. This compares with figures of (528, 2584, 1588 
and 1367) for the same regions in Jiang et al.(2002). The 
excess of objects in our LFs is most likely duo to the superior 
spatial resolution of UKIRT, relative to the set up used by 
Jiang et al. Our WFCAM observations are characterised by 
about 0.8 arcscc FWHM and have 0.2 arcsec pixel sampling 
after microstepping. This compares very favourably with the 
0.45 arcsec pixels of the SIRIUS camera on the Infrared 
Survey Facility used by Jiang et al., which is located at a site 
in South Africa with a typical seeing of 1.5 arcsec FWHM. 
This is supported by figure 17(c): region IC exhibits the 
highest stellar density, and shows a systematically greater 
number of objects in our LFs compared to those of Jiang 
et al., for K>^12 mag down to the detection limit of both 
sets of observations. Regions IN and IS, coinciding with the 
northern and southern nebular regions near the core of M17, 
also have fairly high-density subregions. As such the excess 
of fainter sources {K^ 14 mag) compared to Jiang et al. - 
in particular in region IN - is not surprising if we assume 
that the difference in spatial resolution is the driving force 
behind this behaviour. 

An effect of the larger number of detections in the 
UKIDSS data is to shift the peak of the LFs in the crowded 
regions IC and IS to fainter magnitudes than in Jiang 
et al.(2002). In that publication the authors were unsure 
whether the different locations of the peaks in the different 
regions were due to an actual property of the regions or to 
issues of depth and spatial resolution. It now seems clear 
that the differences are caused by spatial resolution effects, 
and still deeper observations with even higher resolution will 
be required to search for any genuine differences. 

In figure 18 we show an extinction map of the region. 
This was generated by calculating the extinction towards 
each individual star and averaging the results in 15 arcsec- 
ond bins, then gaussian smoothing the results with a FWHM 
of 1.5 bins. The region of high extinction clearly follows the 
"Omega" shape of M17 (also known as the Omega Nebula). 
In regions of very high extinction the extinction derived by 
this method is low, evinced by the white regions with the 
darkest parts of the map. This is because the densest part of 
the molecular cloud obscures all the background stars that 
dominate the star counts in most other parts of the map, 
so only foreground stars with low extinction are detected. 
The extinction towards individual stars was derived from the 
measured H and K fiuxes using the simple formula A.v={H- 
/!'-0.2)/0.063, as in §4.1. Comparison with the K band image 
in the right hand panel of figure 18 shows that the Q, shape of 
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Figure 17. K band Luminosity Functions for M17 in the regions "IC", "IN", "IS" and 2C defined by Jiang et al.(2002). The top left 
panel shows the LFs, the top right panel shows the cumulative source counts and the bottom panel compares the source density with 
that measured in the same regions by Jiang et al.(2002). 
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Figure 18. (left) Extinction map of the M17 field, illustrating the "Omega" shape of the nebula. The greyscale runs from Ay=0 (white) 
to Av=26 (black). The extinction for each star was derived from the (H-K) colour and the results were averaged in 15 arcsecond bins 
(see text §4.2). The white regions surrounded by black are places with very high extinction where only foreground stars are detected. 
(right) K band image of the central portion of the field, illustrating the location of the regions used in figure 17. 
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Figure 19. A three colour JKH2 image centred on tlie L1688 dark cloud in p Ophiuchi. Red patches show H2 emission where the 
outflows impact the denser environment, creating compressed layers of shocked emission revealed in the 2.12 pro filter. Blue and green 
colours correspond to emission in the J and K bands respectively. The image size is 31' 40" X 18' 44". The insert images show close-ups 
of several of the H2 shocks (2.12 /^m images with no continuum subtraction) with the following identifications from Gomez et al.{2003) 
(labelled G) and Khanzadyan et al.(2004) (labelled f). Insert A: New feature; B: G14; C: G8a; D: G11,G12,G13, HH313; E: G20, flO-04; 
F: G24, HH79a; G: G3, f09-01; H: G25, HH711; I: f08-01a 



high extinction (i.e. the Giant Molecular Cloud) follows the 
outer rim of the HII region, which is ionised by the cluster 
located in region IC. Regions with high extinction located 
further north and west of the HII region are seen as dark 
areas in the K band image. 

A full analysis of the M17 data will be presented in a 
future paper. 



4.3 H2 emission in the p Ophiuchi dark cloud 

Figure 19 shows results of WFCAM H2 imaging of the 
p Ophiuchi dark cloud. These data were taken as part of 
UKIDSS GPS Science Verification in April 2005 with the 
same integration times as the TAP survey (see §2), covering 
a 52x52 arcminute field. The p Ophiuchi cloud has been the 
focus of many programmes to understand the formation of 
low-mass stars. While it is difficult to detect the youngest 



YSOs (Class protostars) because their near IR radiation 
is obscured by the material from which they form, prodi- 
gious outflowing streams of molecules are often visible at 
near infrared wavelengths 

The WFCAM data have uncovered a number of new 
parts of these flows. Whereas previous surveys attributed the 
driving sources to be within a few arcminutes, the wide field 
makes it possible to piece together outfiows which may ex- 
tend over 30 arcminutes corresponding to parsec scale flows. 
Two such outflows are indicated in figure 19. The data sug- 
gest that much of what we see spread out on the largest scale 
in p Ophiuchi is actually driven from a single active clump. 
One can then determine the true number of outfiows and 
determine their full extent. With the aid of future epochs 
of observation it will be possible measure their velocity and 
hence estimate their ages and calculate the feedback of me- 
chanical energy into the cloud. Similar parsec scale fiows 
associated with low mass star formation have been identi- 
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Figure 20. Colour magnitude diagrams (CMDs) of the candidate counterparts to the Chandra X-ray sources {coloured points) overlaid 
on a randomly selected sample of the Nuclear Bulge stellar population (small black points) from the UKIDSS DR2 and 2MASS (used for 
sources too bright for UKIDSS DR,2 where saturation results in inaccurate photometry). A sharp decline in source density is seen above 
the thresholds at mjj<12.75 and mi<-<12.0, owing to the lower source density in 2MASS. All sources are selected from the 2° X 0.8° 
region centred on Sgr A* of the Chandra Nuclear Bulge X-ray survey. It is clear that the candidate counterparts trace the overall stellar 
population. The CMD containing J band data has a higher proportion of local dwarf candidate counterparts, indicating this could be 
used to identify more local X-ray sources. The X-ray source candidate counterparts are coloured representing the Chandra hard colour 
(colour bars representing the colour association, see text §4.4 for details of Chandra hard colour) showing that the local dwarf candidate 
counterparts are more commonly associated with X-ray sources with low values of colour, indicating soft spectra, low absorption or both. 
X-ray source counterparts plotted in in black have no net counts in one of the two energy bands (see Muno et al., 2006) 



fied in several other nearby molecular clouds, (e.g. Bally & 
Devine 1994; Devine 1997). 



In figure 19 the red patches mainly correspond to the 
locations where the outflows impact with the environment, 
creating compressed layers of shocked H2 revealed by the 
2.12 /im filter. Blue and green correspond to J band and K 
band respectively. Despite their large spatial extent, these 
flows have dynamical lifetimes that are only of order 10* 
years, assuming typical flow velocities of 100-200 km/s. Since 
the outflows appear to be driven from a single clump this 
suggests that only a few protostars are driving them. The 
fact that most of the pre-main sequence stars in the cloud 
do not seem to be generating detectable molecular outflows 
suggests that the active stage of protostellar accretion is ex- 
tremely short, presumably of the same order as the dynami- 
cal timescale inferred from the extent of the jets detected in 
these data. This argument assumes that most or all proto- 
stars drive molecular jets when undergoing rapid accretion. 



4.4 X-ray sources in the Nuclear Bulge 

We have performed matching of the UKIDSS DR2 
gpsSource catalogue to the ~4200 Chandra X-ray sources 
of the Muno et al.(2003; 2006) catalogues to search for 
the candidate counterparts to the X-ray sources detected 
on sight lines in the vicinity of the Galactic centre. The 
X-ray data covers an approximately rectangular 1.6 deg^ 
area at -1<Z<1, -0.4<6<0.4. There are some gaps in the 
UKIDSS DR2 area coverage in this region, so the area of 
the cross match is 1.21 deg^. The area contains 2.0x10® in- 
frared sources with an average spatial separation of 2.8". 
As shown in Bandyopadhyay et al.(2005), to identify can- 
didate counterparts to X-ray sources in the Nuclear Bulge, 
very high resolutions are required to avoid issues of confu- 
sion due to the high stellar density. The UKIDSS survey is 
the flrst of its type to have sufficient resolution, depth and 
area coverage of the Nuclear Bulge for such a task to be un- 
dertaken. It has sufficient depth to detect giant stars of all 
types in the Nuclear Bulge as well as a significant fraction 
of all main sequence stars to A or F type. By contrast, the 
2MASS survey detects only M giants and O-B type main 
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X-ray error size Number of candidate counterparts 



(arcsec) 


1 


2 


3 


> 4 


Total 


0" - 1" 


523 


110 


49 


27 


709 


1" - 2" 


600 


153 


58 


43 


854 


2" - 3" 


245 


78 


44 


33 


400 


0" - 3" 


1368 


341 


151 


103 


1963 



Table 1. The numbers of candidate counterparts to the Muno et al.(2006) catalogue of X-ray sources from the Wang et al.(2002) 
Chandra observation of the Nuclear Bulge. The number of counterparts has been broken down in to X-ray sources with positional errors 
of 0" - 1", 1" - 2", 2" - 3" and the total numbers on the bottom row. 



sequence stars in the Nuclear Bulge (Cox 2000). Even more 
importantly, the spatial resolution of UKIDSS is adequate to 
identify the correct candidate counterpart to a large fraction 
of the X-ray sources, whereas the 2" resolution of 2MASS 
results in severe source confusion and a much greater likeli- 
hood of mis-matching. 

Numerous studies have tried to characterise these newly 
discovered X-ray sources in the Bulge based on their X-ray 
properties. Pfahl et al.(2002) suggest that a large fraction 
may be wind- accreting neutron stars with high mass com- 
panions. Muno et al.(2003; 2006) and Ruiter et al.(2006) 
propose that they could be white dwarfs accreting from main 
sequence counterparts (cataclysmic variables, polars and in- 
termediate polars). Willems & Kolb (2003) and Liu & Li 
(2006) believe them to be neutron stars with low mass com- 
panions and Wu et al.(2007) have speculated that they could 
be isolated neutron stars and black holes accreting from the 
interstellar medium. Without positive identification of the 
stellar counterparts to these X-ray sources we will not be 
able to differentiate between all these possibilities. The ex- 
tremely high, variable extinction towards the Nuclear Bulge 
requires these observations to be performed in the infrared. 
As mentioned above, the UKIDSS GPS is the first infrared 
survey to have the resolution to allow the identification of 
candidate counterparts to the whole population of X-ray 
sources and so will be a vital tool in the understanding of 
this population. 

We used TOPCAT to identify candidate counterparts 
to the X-ray sources. The stated positional accuracy of 
UKIDSS is ~0.1" as matched to the 2MASS survey. We 
used a larger 0.3" positional error for the UKIDSS DR2 
source positions to account for the possible confusion in the 
Nuclear Bulge caused by the very high stellar density. The 
"out-of-the-box" Chandra positional error is 1", though this 
can be considerably larger (up to 9") in some cases due to 
image distortion in the outer regions of the Chandra array 
and/or to the very low S/N of the faintest sources detected. 
For the purpose of this study, we discarded all X-ray sources 
for which the X-ray positional errors were greater than 3" 
radius; for error circles greater than this size, the number 
of candidate IR counterparts is sufficiently high (5-30 stars) 
to render meaningless any attempt at astrometric matching 
between the IR and X-ray catalogs. We used the 0.3" po- 
sitional error for the UKIDSS DR2 source positions, along 
with the derived positional errors from the Muno et al.(2003; 
2006) X-ray source catalogues to set a maximum separation 
of 3.3" between and X-ray source and a star that can be 
considered a match. 



Of the 4256 X-ray sources in the Muno et al.(2003; 2006) 
catalogues, 3963 X-ray sources have positional errors of less 
than 3". From these, we found 3076 candidate counterparts 
to 1963 of the X-ray sources. There are 1368 X-ray sources 
with only one candidate counterpart, 341 with two candi- 
date counterparts, 151 with three and 103 with 4 or more 
candidate counterparts. Table [T] gives a breakdown of the 
numbers of counterparts for X-ray sources with error circles 
of 0" - 1", 1" - 2" and 2" - 3". As can be seen, for X- 
ray sources with positional errors of 0" — 1", the majority 
of these have only one candidate counterpart. Those with 
1" — 2" errors have a slightly larger proportion with 2 can- 
didate counterparts. As might be expected, the sources with 
the larger error circles, 2" — 3", have a much larger propor- 
tion with multiple candidate counterparts. 

Figure 20 shows colour magnitude diagrams (CMDs) 
of the candidate counterparts compared to a representative 
sample of Nuclear Bulge stars from the UKIDSS DR2 cat- 
alogue. The plots include both UKIDSS data and 2MASS 
data, the latter being used at bright magnitudes where stars 
may be saturated in the UKIDSS GPS. The saturation lim- 
its in the GPS vary depending on the sky background and 
seeing. Conservatively it is best to assume that sources with 
m_ff<12.0, m_f/<12.75 or mj<13.25 may be saturated al- 
though the typical saturation limits are ~0.5 magnitudes 
brighter than this. The saturation limit for individual fields 
can be determined with the ppErrbits parameters in the 
WSA (see §A2) but here we used the above thresholds for 
simplicity. 

From the diagrams it can be seen that the X-ray sources 
follow the general trend of the overall stellar distribution but 
with some differences to their specific distribution. In par- 
ticular, the CMDs containing J band data reveal a dispro- 
portionately high proportion of the candidate counterparts 
to be in the local, main sequence arm of the CMDs (see 
§3.2 for discussion and analysis of the CMDs of the Bulge). 
The majority of the remaining candidate counterparts in all 
three CMDs are consistent with red giants at the distance 
of the Bulge with variable levels of extinction. Figure 20 
also shows the X-ray source counterparts colour-coded by 
the X-ray hard colour taken from the Chandra catalogue 
(hard colour is defined as: (h — s)/{h + s), where h and s 
are the net counts in the 4.7-8.0 keV and 3.3-4.7 keV energy 
bands; Muno et al., 2006). Sources detected only in one of 
these two bands are plotted in black. As is clearly visible, 
the colours show that the X-ray sources with local, dwarf 
candidate counterparts have a higher proportion of sources 
with low values of colour, indicating a soft spectrum and/or 
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low absorption. The large proportion of single X-ray band 
detections (black points) in the local dwarf branch is mainly 
due to sources detected only in the 4.7 keV band, which is 
consistent with these being mostly soft X-ray sources. 

Based on the identification of candidate counterparts 
to these X-ray sources, we can draw some early conclusions 
as to the likely nature of the different populations of X-ray 
sources. We can separate the sources into those with local 
dwarf candidate counterparts and those with Bulge giants 
when analysing the populations. Wc noted above that the 
local dwarf candidate counterparts have a higher likelihood 
to be associated with X-ray sources with low values of hard 
colour, indicating cither an intrinsically soft spectrum or low 
absorption. Since local dwarfs make up only a small fraction 
of the total number of sources in this Nuclear Bulge field, 
the fact that they are disproportionately represented among 
the candidate X-ray source counterparts indicates that these 
arc almost certainly genuine matches, most likely CVs or X- 
ray active stars. For the X-ray sources with Nuclear Bulge 
giant counterparts, we see a generally higher value for the 
X-ray source colour indicating a harder or more absorbed 
X-ray spectrum in line with the higher level of absorption 
expected for the Nuclear Bulge X-ray source population. 
The very high stellar density (average separation of 2.8"in 
the GPS and 1.94" at K = 19; Goshng et al.2006) does 
mean that a fraction of the Bulge candidate counterparts 
will be chance positional coincidences, but the correlation of 
"hard" X-ray sources with Nuclear Bulge giant counterparts 
suggests that we are detecting true companions to many of 
the X-ray sources. The colour distribution of the candidate 
counterparts suggests the population to be predominantly 
Nuclear Bulge giants, an older, more evolved population. 
These systems make up the majority of the total population 
of X-ray sourcescounterparts. They are likely to be domi- 
nated by LMXBs with evolved companions. 

4.5 External galaxies in the Zone of Avoidance 

External galax:ies located close to the Galactic plane are 
hidden from view at optical wavelengths by the high extinc- 
tion. For this reason the Galactic plane is referred to as the 
"Zone of Avoidance" or "ZoA" in extragalactic studies. Lo- 
cating relatively nearby galaxies (within ~100 Mpc) in this 
region is of interest for studies of the structure and kine- 
matics of the Local Group and as well as nearby large-scale 
structures. This includes revealing the nature of the " Great 
Attractor", e.g. Dressier et al.(1987); Kolatt et al.(1995); 
Kraan-Korteweg & Lahav 2000, Schroder et al.(2007); Ko- 
ccvski et al.(2007); and the origin of the dipole in the Cos- 
mic Microwave Background. Near infrared surveys such as 
2MASS and DENIS have proved their effectiveness in pene- 
trating the ZoA (Jarrett et al. 2000b; Rousseau et al. 2000) 
provided the star density does not exceed a certain value 
(Kraan-Korteweg &: Jarrett 2005). The potential for such 
studies is greatly improved by the higher sensitivity and 
spatial resolution of the GPS. Furthermore, the star-galaxy 
separation is improved by the additional detection of the 
fainter galaxy disks. 

We noted in §3.2 that external galaxies form a dif- 
fuse but well defined group of spatially resolved sources 
(psteir<C 0.9) at the lower right of the colour-magnitude 
diagrams for fields in the outer galaxy {I > 90°). The pop- 



ulations of external galaxies in figures 9 to 13 are located 
at 1.5<(J-K)<3.0, tuk > 14. The galaxies appear redder in 
figures 9 to 11 than in figures 12 to 13, owing to the greater 
extinction along the line of sight at Z=98° than near Z=170° . 
We note that the vast majority of these galaxies are too faint 
to be detected in 2MASS. 

In colour-magnitude diagrams for the outer galaxy that 
employ the (J-K) colour (see figures 9 to 13) there is very 
little overlap between the stars classified as having "more 
reliable" photometry and the spatially resolved extragalac- 
tic sources, which are included in our "less reliable' cat- 
egory (marked with blue points). The paucity of point 
sources in the extragalactic group demonstrates that WF- 
CAM has spatially resolved the great majority of galaxies 
that are detected in the GPS. Inspection of the parameters 
in gpsSource for the objects in the region of the plots occu- 
pied by the extragalactic group shows that nearly all of them 
have pstar <g;0.9, pGalaxy>0.9 and mergedClass=+l. A 
significant fraction of the individual sources are not unam- 
biguously resolved in a casual visual inspection of the image 
data, but the classification in the WSA clearly benefits from 
the quantitative measurement of the observed spatial pro- 
file in all three passbands that is provided by the reduction 
pipeline. In the WSA, the nature of the profile is described 
by the parameter mergedClassStat, which describes the 
extent to which a source appear stellar, resolved, or under- 
sized (eg. a cosmic ray artifact affecting only 1 pixel). This 
parameter has a Normal distribution for sources in each ar- 
ray, with mean zero and variance unity. Large values corre- 
spond to resolved sources and negative values to under-sized 
objects. The mergedClassStat parameter for objects in 
the region of the plot occupied by the extragalactic group 
typically has a value between 2 and 10. 

We note that the classification of source image profiles 
as resolved or unresolved becomes unreliable near the sensi- 
tivity limits (m/f > 18). 

These results for the relatively uncrowded fields in the 
outer Galaxy are highly encouraging. However, the same 
cannot be said for much more crowded fields in the direc- 
tion of the inner Galaxy. We noted in §3.2 that sources classi- 
fied as spatially resolved (mergedClass=+l) are sometimes 
merely unresolved stellar pairs. Such pairs are very common 
in crowded fields. 

To test whether it is possible to identify external galax- 
ies in crowded fields using the source catalogues, we visually 
inspected a crowded field located at {I, b) — (30.2,1.2) The 
area of this field is 1 WFCAM tile (0.75 deg^). Visual inspec- 
tion of the 48 images (16 WFCAM arrays in 3 passbands) 
yielded 38 galaxies, 15 possible galaxies and one nebulous 
region. We refer to this selection as galaxies for brevity. The 
nebulous region is not listed in the source catalogues. Fifty 
two of the fifty three galaxies have mergedClass=+l, cor- 
responding to pGaleixy>0.9. By contrast, the field contains 
■~1.6 million unique sources, of which 18% have merged- 
Class=+1. 

The (J-K) vs.K colour-magnitude diagram for this field 
(not shown) shows no clear separation between the galax- 
ies and the sequence of reddened red clump giant stars in 
the field, even when the plot is restricted to sources with 
mergedClass=+l. The galaxies satisfy the constraints (J- 
K)> 1.7 and uik <16.5, with a further restriction to (J- 
K)> 1.9 if we include only definite galaxies from the visual 
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inspection. These cuts agree well with an intrinsic galajcy 
colour of (J-K) = 1.0, subjected to the (total) extinction in 
this particular field. 

Two further methods were used to attempt to sepa- 
rate the galaxies from stars: (i) the value of K_lPetroMag 
- K_lAperMag3 (i.e. a measure of resolved (Petrosian) mag- 
nitude minus point source magnitude); and (ii) cuts based 
on the parameter mergedClassStat. We found that both 
methods can be used to reduce the number of stars with 
mergedClass=+l and similar colours and magnitudes to 
the gala^xies. However, a cut based on either method requires 
us to throw away a significant fraction of the galaxies and 
the remaining contaminants still outnumber the the galaxies 
by a factor of at least 20. 

We conclude that whilst in the outer Galaxy exter- 
nal galaxies can be easily found using the parameters in 
the catalogue, in crowded fields they offer no real advan- 
tage over a laborious visual search of the images. Better 
results may be achieved in the inner Galaxy at latitudes fur- 
ther from the mid-plane (up to the survey limit at b=±5°), 
when these data become available. We note that the extinc- 
tion through the Galaxy (as defined by the DIRBE maps 
of Schlcgcl ct al.l998) and the amount of source confu- 
sion and reddening varies greatly oven within the 0.75 deg^ 
field (5.1<Av <21.6). Hence it may be possible to improve 
the discrimination between galaxies and red clump giant 
stars in future by adding a Galactic extinction parameter 
to gpsSource for each star (defined in a grid with a scale of 
a few arcminutes) and subtracting this from the measured 
extinction. 
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Figure 21. A combination of UKIDSS GPS and IPHAS 
data. This (i'-J) vs.(J-H) two colour diagram for the field at 
{I, 6)=(31,0) allows us to distinguish distant early type dwarfs (lo- 
cated on the lower sequence of reddened stars) from nearby late 
type dwarfs (the main clump at (i'-J)=2.5 and distant red clump 
giants (the upper sequence of reddened stars). Data for unred- 
dened stars from Hewett et al.(2006) are plotted at the lower left 
as filled symbols, linked by solid lines. The approximate location 
of unreddened main sequence spectral types is marked. Giants 
are redder than dwarfs in (J-H) at types K and M, leading to 
separate dwarf and giant sequences. 



4.6 Spectrophotometric typing with optical and 
infrared data from IPHAS and the GPS 

A limitation of the GPS data is that while the luminosity 
class of an individual field star can often be determined, it is 
not possible to estimate the spectral type in fields with sig- 
nificant reddening. This is because the reddening vector in 
the (H-K) vs. (J-H) two colour diagram runs almost parallel 
to the main sequence of stars with spectral types 0-K, in- 
troducing a degeneracy between reddening and the intrinsic 
colours of the photosphere. The approximate reddening for 
individual sources can be estimated using the (H-K) colour, 
but only by assuming the all photospheres have the same 
intrinsic colour. A more precise reddening value can be de- 
temined for YSOs by dereddening to the Classical T Tauri 
locus in the two colour diagram, as noted earlier, but the 
result is only valid for stars that are known to be YSOs. 

We anticipate that this degeneracy can be broken for 
many of the brighter sources in the GPS by using optical 
data from IPHAS survey (Drew et al.2005; Sale et al., in 
prep) . This survey covers the entire northern Galactic Plane 
down to Dec=-5° at —5 < 6 < 5 in the Sloan r', i' filters and 
an Ha filter. The survey is now almost complete and the 
IPHAS Initial Data Release (Gonzalez-Solares et al.2008) 
is now publically available, containing data from 1=27 to 
/=117. 

Work by members of the IPHAS team (Drew et al.2005) 
has shown that the IPHAS (r'-i') vs. (r'-Ha) two colour di- 
agram can be used to break the degeneracy with extinction 
and estimate the approximate spectral type and distance of 
most field stars. However, this requires detection in the r' 



and Ha filters, which is not usually achieved in fields with 
high reddening. Fortunately, i' fluxes will often be available 
for GPS sources even in fields with quite high reddening, 
since the IPHAS 5cr sensitivity limit is i' « 20.7 (Vega sys- 
tem). 

In figure 21 we show an example of an IPHAS/GPS 
(i'-J) vs. (J-H) two colour diagram for the field with high 
reddening at (l,b)={31,0) that was illustrated earlier in fig- 
ure 5. The "PhotoObjBest" IPHAS catalogue, which con- 
tains sources that were observed in good photometric con- 
ditions and detected in all 3 IPHAS passbands, was used 
for the cross match. The scatter in the diagram has been 
reduced using cuts very similar to the "most reliable, low 
completeness" cuts used for the near infrared two colour 
diagrams (sec §A3), including a restriction to sources with 
uncertainties <0.05 mag in i' and J. The two sequences of 
stars extending to the upper right of the diagram are dwarfs 
(lower sequence) and giants (upper sequence). The separa- 
tion between the two sequences is cleaner than in the two 
colour diagram in figure 5 because the separation in the (i'- 
J) colour is ~1.25 magnitudes, which is far larger than the 
typical photometric uncertainties in this colour. That is not 
the case for the (H-K) colour used in figure 5, even though we 
restricted the selection to sources with uncertainties listed 
as <0.05 mag on each axis. The fundamental reason for the 
separation remains the different intrinsic (J-H) colours of 
the two populations, which is due to the gravity sensitiv- 
ity of the ( J-H) colour in the giants and the earlier spectral 
types of the dwarfs. 

In addition to the improved dwarf/giant separation, a 
major benefit of figure 21 is the potential for spectropho- 
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tomctric typing. The sources on the reddened dwarf branch 
arc ahnost entirely distant early type stars (spectral types A 
and F). This can be seen by tracing the dwarf branch back to 
where it would intersect with the unreddened main sequence 
that is ovcrplottcd (using data from Hcwott ct al.2006). The 
location of the dwarf branch is consistent with either red- 
dened A-F type stars or M dwarfs. The latter can be ruled 
out since they are not detected at large enough distances to 
incur significant reddening. This interpretation is supported 
by a Besangon model of the field which confirms that this 
branch is composed of A and F type stars. (The model uses 
the Cousins I filter rather than Sloan i' so it is not exactly 
comparable and is therefore not shown). 

The reddened dwarf sequence can also be traced back to 
the lowest part of the main clump of stars near (i'-J)=2.5, 
as would be expected for these intrinsically blue, early type 
stars. The larger population of G and K type dwarfs is con- 
tained in the clump of apparently bluer stars near (i'-J)=2.5 
since these less luminous stars are not detected at sufficiently 
large distances to suffer such high reddening. The clump 
appears elongated from lower left to upper right, and is en- 
tirely consistent with a reddened population of G and K 
type main sequence stars that mostly have similar extinc- 
tion. This sequence is sufficiently inclined to the reddening 
vector to break the colour-reddening degeneracy and distin- 
guish G-type stars from K-type stars. The extinction toward 
stars in the G-K clump is Av=4-5 magnitudes and the im- 
plied distances arc typically ~l-2 kpc. The most distant red 
clump giants in the plot have AyRiQ magnitudes and im- 
plied distances of ~3.8 kpc. Stars that axe 1-2 magnitudes 
fainter in i' than those plotted, and therefore more distant, 
could be detected by (i) cross matching with the IPHAS 
"PliotoObj" catalogue rather than PhotoObjBcst; and (ii) 
including sources with larger photometric uncertainties. The 
PhotoObj catalogue does not require detection in all three 
IPHAS passbands for a star to be included. 
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APPENDIX A: GUIDE TO THE WFCAM 
SCIENCE ARCHIVE FOR GPS SCIENCE 

Al Fiindamentals of the GPS data, sirchive 
structure and usage 

In this guide to the CPS wo do not assume that the reader 
has previously used the WSA, nor any knowledge of SQL 
(the Structured Query Language which is used to con- 
struct and interrogate the WSA). There is extensive general 



UKIDSS documentation at the WSA and CASU websites 
so this guide focuses on information that is useful for GPS 
science. 

A 1.1 Multiframes: the basic unit of data 

WFCAM has four spatially separated 2048 x 2048 HAWAII 
2 infrared array detectors laid out in a 2 x 2 square pat- 
tern. Each array covers a 13.65x13.65 arcminute field of 
view and the gaps between them are 12.83 arcmin, or 94% 
of a detector width. A single observation therefore covers a 
partially filled area of sky known as a "multiframe" . In the 
GPS, a series of four slightly overlapping multiframes with 
offsets of 13.24 arcmin in RA and Dec is used to produced a 
filled "tile" with an area of 0.75 deg^. The multiframe (not 
the tile) is the fundamental unit around which the WSA is 
designed. Every multiframe has a reference number in the 
WSA. By contrast, the tiles are not catalogued. The WF- 
CAM arrays are always oriented approximately north-south 
(to within ~ 1°), so the GPS tiles the Plane in strips of 
constant Declination, always slightly overfilling the desired 
Galactic latitude boundaries with array corners and edges. 

The term multiframe is also applied to a coadded se- 
quence of observations with spatial offsets of a few arcsec 
(see A1.2). Every pipeline processed multiframe is stored in 
the WSA (both individual integrations and a coadded series 
of observations). However, only the coadded images have 
associated source catalogues (see A1.4). 

A 1.2 GPS observation sequence 

All CPS data are taken with a 2 x 2 "microstepping" se- 
quence in order to fully sample the spatial resolution of the 
image in the focal plane, which is frequently < 2 pixels Full 
Width Half Maximum, i.e. < 0.8 arcsec. The microsteps used 
in the GPS are spatial shifts of 4.6 arcsec, which is approx- 
imately 11.5 pixels (depending on location within the focal 
plane). This "N-l-0.5" microstepping allows the four separate 
images to be interleaved in order to produce images with a 
0.2 arcsec pixel scale. After going through the microsteps, 
the telescope is then moved through 3.2 arcsec (8 pixels) 
in RA and/or Dec before repeating the microstep sequence. 
This jitter procedure provides independent pixels to aid the 
removal of any bad pixels that were not included in the bad 
pixel mask. From DRl onward the number of jitters is two, 
so the observations comprise eight individual exposures of 
duration 10 s, 10 s and 5 s to make up total on source inte- 
gration times of 80 s, 80 s and 40 s in the J, H and K filters 
respectively. In the EDR four jitters were used for the J and 
H bandpass observations, with 5 s individual integrations. 
The number was reduced to two in order to improve observ- 
ing efficiency. Observations in the H2 filter used two jitters 
in all data releases, with eight individual exposures of 20 s 
making up the on source integration time of 160 s. 

A 1.3 GPS images in the WSA 

The multiframe images arc stored as FITS images with a 
header unit and four image extensions, one for each spa- 
tially separated array. There are three main types of reduced 
GPS multiframe image. (1) Individual integrations, which 
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are listed as normal multiframes and have a pixel scale of 
0.4 arcsec. (2) Interleaved images, listed as leav multiframes, 
which are formed from four images taken in the microstep- 
ping sequence and have a 0.2 arcsec pixel scale. (3) Final 
coaddcd images (also with a 0.2 arcsec pixel scale) that arc 
formed by coadding two or more interleaved images and are 
listed as leavstack multiframes in the WSA. For most pur- 
poses GPS users will only need to use the leavstack multi- 
frames. 

The user can make a list of the GPS multiframe images 
for download, and view high quality jpeg preview images, 
by selecting "Archive Listing" at the top level of the WSA 
and then selecting the GPS from the Programme drop down 
menu. The user will normally use the most recent UKIDSS 
release by default but there are also options to use either the 
earlier releases (which are a subset of the most recent release 
but can be searched slightly more quickly) or "WSA" in or- 
der to include images which failed quality control but may 
still be useful for some scientific purposes. The three types 
of reduced science image described above will be returned 
if the default settings of "object" in the Observation Type 
drop down menu and "all" in the Frame Type drop down 
menu arc used. If the latter menu is sot to "stack" then only 
leavstack images will be returned. The desired images can 
then be selected by date, by filter and by Equatorial coor- 
dinates (FK5 system, defined by a rectangular box). Two 
additional types of GPS image are the H2-K difference im- 
ages provided for the off-plane TAP region and confidence 
images, which exist for all leav and leavstack images and can 
be used to check for defects in the data. The difference im- 
ages are scaled by the filter width to remove sources with 
a flat continuum. The confidence images are only returned 
if the Observation Type is changed from "object" to "confi- 
dence" . Pixels in the confidence images have a median value 
of 100. Lower values correspond to either reduced sensitivity 
measured in the flat fields, bad pixels, hot pixels detected in 
the darks or saturated pixels. Other image types from the 
reduction process, such as Sky images and Dark images can 
also be obtained with the appropriate setting of the Obser- 
vation Type. Sky images are generally constructed using a 
sequence of frames taken throughout the night. These may 
be useful in order to assess the quality of the background 
subtraction in Galactic plane regions with bright nebulos- 
ity. 

The coordinates returned by the Archive Listing for a 
multiframe are those of the centre of the WFCAM focal 
plane, which lies between the four spatially separated arrays 
and is therefore a region with no data! Hence if the full array 
image nearest a particular location is desired, it is necessary 
to either use the Image tools described below to find the 
multiframe number or else search for all images in a box 
0.8° wide, centred on the target location, to find the desired 
one. 

More user friendly image access is provided by the 
"Gctlmagc" , "MultiGetimagc" and "Region Search" tools at 
the top level of the WSA, and via links from the gpsSource 
catalogue entries described below. The "Getlmage" tool al- 
lows the user to find and download cut-out images in all 
available filters, centred on the desired Galactic or Equa- 
torial coordinates. The image size is limited by the user- 
specified dimensions or by the boundaries of the 13.6 ar- 
cminute array field of view. There may be more than one 



image in each filter if the coordinates lie in the small over- 
lap region at the edge of an array, or if the field has been 
observed more than once. Second epoch observations have 
not yet occurred for the GPS, except when a field has been 
repeated by the observer to ensure the data quality satisfies 
the survey requirements. The "MultiGetimage" tool is simi- 
lar but it allows a large number of images to be downloaded 
in a single filter, each centred around a list of Equatorial 
coordinates provided by the user. The "Region Search" is 
described below. 

The UKIDSS astrometric solutions are determined by 
fitting to numerous 2MASS stars in the field of each sepa- 
rate science array, as noted in §2. The typical precision of 
the solution is 0.09 arcsec, as determined from the r.m.s. 
residuals to the fit, but there is some variation. The value of 
the r.m.s. residual for a given science detector array is given 
by the STDCRMS parameter. This parameter is given in 
the image headers for each array. It is also accessible via 
the CurrentAstrometry table in the WSA, which can be 
viewed using a FreeForm SQL query (see §A1.4). 

A 1.4 GPS catalogues 

Most users will wish to use the SQL source catalogues to 
search for astronomical objects. gpsSource is the most com- 
monly used catalogue. It is the merged catalogue containing 
most of the important data for all sources detected in at 
least one of the J, H and K passbands. The pairing scheme 
starts with the shortest available wavelength (the J band if 
the data are available, or the H band if not) and searches 
for counterparts in the H band and then the K band that 
are located <1 arcsec from the J band detection. Sources 
are only merged as a single entry in gpsSource if each is 
the closest match to the other at all wavelengths. The epoch 
of observation of a star in gpsSource in each filter can be 
determined by clicking on the entry in the "getMFlink" col- 
umn in the table, which will provide 1x1 arcminute images 
centred on the star, with associated dates and times. The 
other main table, gpsDetection, contains all the informar 
tion for all sources detected in any passband but with no 
association between detections of the same object in dif- 
ferent filters. While gpsSource is more convenient to use, 
the caveats are: (i) it includes only a subset of the photo- 
metric aperture sizes that are available in gpsDetection or 
the FITS binary catalogues; and (ii) in releases from DR3 
onward, most of the extragalactic fiux extimates such as 
Petrosian magnitude are no included. The details of how to 
write an SQL query to extract these missing attributes from 
the gpsDetection table are given in the "SQL Cookbook" 
at the top level of the WSA. 

The available columns or "attributes" in each table are 
seen when conducting a Menu Query (see below), which op- 
tionally provides a one line description of each attribute. 
We recommend that the user reads the more detailed en- 
tries for the gpsSource and gpsDetection parameters in 
the "Schema Browser" (accessible from the WSA top level) 
under "WSA UKIDSS" and then "Tables". 

The GPS SQL catalogues can be searched using the Re- 
gion Search, the Menu Query (recommended for new users) 
or the Freeform SQL page, all of which are accessible from 
the top level of the WSA. When using these tools, users 
of the DR2 release and subsequent releases have an op- 
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tion (in the drop down menu of tables) to view a subset 
of gpsSource which includes only areas which are com- 
plete in all three filters. This can slightly improve search 
speeds where the desired area is known to be complete (e.g. 
from an Archive Listing) . If previous data releases arc being 
searched, the user can similarly search a dataset with full fil- 
ter coverage by selecting e.g. the DRl database, as opposed 
to the larger DRIPLUS database. 

The simplest way to search the gpsSource table is with 
the Region Search. This enables a simple search of a circular 
area around the desired Equatorial or Galactic coodinates 
and it returns all information in the table for every object 
therein. In addition, the sources listed can bo clicked on 
to provide jpeg and FITS format cut-out images around 
their location, in all available filters (similar to the Getlmage 
results). 

More flexible and sophisticated queries can be made 
with the Menu Query or a FrccForm SQL query. Menu 
Queries are constructed by clicking boxes. They return only 
the desired attributes and restrict their values (e.g. magni- 
tudes, colours, coordinates etc.) in order to refine a search. 
This is especially important for the GPS because a maxi- 
mum of only 15 million data entries can be returned in a 
single query (e.g. 1.5 million sources with 10 data attributes 
each), which can quickly be exceeded when searching more 
than a square degree in crowded fields. The gpsSource at- 
tributes that are highlighted in the Menu Query list have 
their own indices in the table, which means that searches 
based on these parameters will proceed much more rapidly 
than those which do not. Hence searches based on e.g. spec- 
ified coordinates (Equatorial or Galactic) and a range of 
magnitudes will proceed rapidly if the prferred photomet- 
ric aperture (AperMagS, sec below) is used, but will take 
much longer if a different aperture size is used. Again, this 
is a serious issue since the size of the GPS catalogues means 
that searches can often time out (the maximum duration 
is presently 80 minutes) when searches arc based on non- 
highlighted attributes. Merely including non-highlighted at- 
tributes among the data to be returned (as opposed to spec- 
ifying a range of values) does not cause this problem. 

FrccForm SQL queries can perform the same tasks as 
a Menu Query and also manipulate the data returned from 
the table or operate on two or more tables (e.g. GPS data 
and Spitzer GLIMPSE data) to produce more complicated 
outputs. Menu Queries are transformed into ASCII format 
SQL queries before processing. A useful feature of the WSA 
is that this SQL query can be viewed using the "Show and 
Edit SQL" button during a Menu Query. This enables the 
user to learn how to perform basic FrccForm SQL queries 
with no prior knowledge of the language. The SQL Cook- 
book provides further training. 

A list of FITS binary source catalogues associated with 
individual coadded multiframe images (leavstacks) is pro- 
vided along with the image lists described in §A1.3 in an 
Archive Listing. They may be viewed with freely available 
software such as FV (FitsViewer). These catalogues contain 
only single filter data and source fluxes within the various 
apertures are given in data numbers (normalised to the ex- 
posure time of individual frames on sky) rather than mag- 
nitudes. The catalogue headers include the aperture correc- 
tions, photometric zero points, airmass and the exposure 
times (for the individual frames on sky), so it is straight- 



forward to calculate magnitudes or to conduct additional 
photometry, e.g. on very faint sources which may have been 
missed by the source detection algorithm and are therefore 
not in the source catalogues. We note that the FITS bi- 
nary catalogues have less precise R.A. and Dec coordinates 
than the SQL source catalogues, owing to a limitation on 
the number of data bits which corresponds to a precision 
of ~0.36 arcsec. However, the exact coordinates can be re- 
covered using the precise X, Y array coordinates given for 
each source and the WCS information in the catalogue FITS 
headers. Details of the columns in the FITS binary cata- 
logues (which provide additional information relevant to the 
SQL source catalogues) are available from the CASU website 
(Irwin et al.2005) and in Irwin et al.(2008). 



A1.5 Simultaneous multi- aperture fitting photometry and 
AperMagS 

The WFCAM pipeline performs aperture photometry, as op- 
posed to profile fitting photometry. However, any overlap- 
ping apertures are simultaneously fitted and the fluxes in 
areas of overlap axe assigned to each target in proportion 
to the total flux within each aperture. This generally pro- 
duces good photometry even in the crowded GPS fields (see 
A3), though there will be CEises where the photometry is im- 
perfect, such as faint sources on the wings of a bright or 
saturated star. We note that profile fitting photometry may 
be included in the final processing of all UKIDSS data when 
the survey is finished, but tests with an algorithm designed 
for the irregular image profiles often generated by ground 
based microstepped data have shown no significant gain in 
precision. 

Photometry is output for a series of circular apertures, 
for which consecutive aperture rmmbcrs have a ratio of V2 
in diameter. The smallest aperture, AperMagl, has a diam- 
eter of 1 arcsec. The default aperture that is used for GPS 
point source colours in gpsSource is AperMagS, which has 
a diameter of 2 arcsec. Only AperMagS, AperMag4 and 
AperMagS and the much larger Hallmag aperture (which 
is a measure of the total flux from each source) are avail- 
able in gpsSource in the existing data releases (though all 
apertures arc available in gpsDetection). We recommend 
that AperMagS is used for most GPS science unless there is 
good reason to desire a smaller or larger aperture. For future 
UKIDSS releases it is planned to change the available aper- 
tures in gpsSource to AperMagl , AperMagS and Aper- 
Mag4, with the rationale that the smaller aperture yields 
more reliable photometry in the most crowded fields and 
the slightly larger aperture will sometimes yield more pre- 
cise photometry in the off-plane TAP region, where relar- 
tively poor seeing is accepted. 



A2 Limitations and unusual features of the GPS 
data 

Here we describe some peculiar features of the GPS data 
which the user may notice but whose origin may not be 
immediately obvious. 

• In many images the profiles of stars in the leav and 
leavstack images may appear to be slightly "speckled" with 
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departures from the smooth monotonic image profiles usu- 
ally seen in astronomical data. This is due to the changes 
in the natural seeing during the microstepping procedure 
referred to in A1.2, which can cause adjacent rows and 
columns to show significant differences in flux. Comparison 
of fields with and without this effect have shown that it has 
no significant effect on photometric precision. 

• Electronic cross talk between adjacent array read out 
channels causes the appearance of spurious sources located 
at multiples of 128 pixels (51.2 arcsec) away from all bright 
stars along the axis of array readout (which is oriented dif- 
ferently for each quadrant of an array). This is a general fea- 
ture of Hawaii 2 arrays that is most serious for pixels which 
are close to saturation (> 30000 DN) in the WSA images. 
Note that the separation 128N pixels becomes 256N pixels 
in the coadded and interleaved leavstack images. All data in 
the DR2 release benefit from a cross-talk removal procedure 
which has reduced the problem. Hovewer, the procedure is 
imperfect so some artifacts remain. This is particularly so in 
regions with spatially variable background, where a flaw in 
the procedure actually generated additional cross talk arti- 
facts that do not always lie 128N pixels from a bright star. 
Owing to this error some images of regions with largo gradi- 
ents in the background were removed from DR2, including 
several H band images near the Galactic Centre. The error 
will be fixed from DR3 onward and the lost images will be 
recovered. The archive also addresses the cross talk problem 
from the DR2 release onward, but iu the catalogues rather 
than the images. Possible cross talk artifacts are flagged in 
the data given in the ppErrbits columns in gpsSource 
and gpsDetection. This identifles almost all cross talk ar- 
tifacts at 128N pixels from a bright star by using the 2MASS 
Point Source Catalogue. Usage of the ppErrbits flags to 
remove suspect data is described in A3. 

The brighter artifacts can be seen to be spurious since 
they have positive counts on one side of the profile and neg- 
ative counts on the other (relative to the sky level). This 
is because the cross talk artifacts are related to the spatial 
derivative of the profile of the bright star that generated 
them, in the readout direction. Fainter stars can sometimes 
produce artifacts which look more like real stars (and gen- 
erally have similar colours). It is advisable to guard against 
this by checking the images for bright stars near objects 
of interest. Over-saturated stars produce laxger, obviously 
non-stellar artifacts with a ring-like structure. 

• Saturated stars have ^ 35000 counts in leavstack im- 
ages, but the centre of an oversaturated star has a lower 
count level (often below the sky background). This is a gen- 
eral feature of data taken with near infrared arrays iu the 
correlated double sampling mode (CDS). Saturated stars ex- 
ist in most GPS fields. They are usually included in the 
source catalogues but the fluxes given are not meaningful. 
Again, the ppErrbits column in the catalogues can be used 
to efficiently remove these bad data, in this case by requiring 
e.g. JppErrbits<65536 to ensure that a source is not sat- 
urated at J band. Conservatively, it is safer to use 2MASS 
at mK<12.0, mif<12.75, m,7<13.25, though the typical sat- 
uration limits are ~0.5 magnitudes brighter than this. The 
ppErrbits parameters can be used to find the precise limits 
for each field in each filter, using the threshold value quoted 
above. Heavily oversaturated stars cause multiple spurious 
detections in each filter (and in gpsSource), since the satu- 



rated region is larger than the size of the stellar image profile 
used by the pipeline source detection algorithm. These are 
often listed in the archive as resolved sources rather than 
stars (i.e. mergedclass=+l). 

• Photometric errors are generally under-estimated. The 
photometric errors given in magnitudes and colours are only 
notional internal errors (excluding calibration uncertainty) 
that are based on source counts in the apertures, the Pull- 
Width-Half-Maximum (FWHM) of the image and the stan- 
dard deviation of the background flux. A plot of errors vs. 
magnitude yields a single smooth curve for fields contained 
within a single WFCAM array. The WSA is similar to the 
2MASS Point Source Catalogue in this respect. The given er- 
ror systematically underestimates the true error by a factor 
of ~1.2, owing to the inter-pixel capacitance of the WFCAM 
arrays. This phenomenon reduces the noise estimate derived 
from a measure of the local standard deviation by this factor 
by causing inter-pixel correlation of the count values (Irwin 
ct al.2008). Hence sources with photometric uncertainties of 
0.20 magnitudes in the archive are in fact usually closer to 
4(7 detections than 5a detections. Additionally, errors spe- 
cific to individual sources are not included in the error bud- 
get: e.g. bad pixels, the presence of adjacent stars or cross 
talk artifacts, or small scale spatial variations in sky back- 
ground. Finally, the CASU pipeline is designed to deliver 
an absolute photometric precision of ~0.02 mag for sources 
with a high signal to noise ratio. (The uncertainty in the 
photometric calibration is discussed below). Therefore users 
should not take seriously the millimagnitude uncertainties 
that are quoted for bright stars. When studying a particu- 
lar source users should inspect the image data (and perhaps 
also the confidence Images) to be sure that the given error 
is meaningful. 

• Nebulosity can also lead to photometric errors, since 
the sky background is evaluated with a relatively coarse grid 

in each field (typically in 25 arcsec grid cells). A more so- 
phisticated method of measuring the background on smaller 
spatial scales in nebulous regions has been developed by 
M.Irwin in order to improve the photometry but it will not 
be implemented in the WSA until DR5. More seriously, the 
source detection algorithm fails in regions of very bright neb- 
ulosity, since the nebula is interpreted as a single extended 
source. At present this effect lias been seen only in parts 
of the M17 star formation region (see §4.2) which is one of 
the largest regions of bright nebulosity in the sky. In such 
regions, it is necessary at present for users to do their own 
photometry (see §4.2) 

Finally, we provide the extinction dependent 2MASS- 
WFCAM photometric transformations that are used to gen- 
erate the photometric zero points for each detector, as noted 
in §2. 

J = J2M- 0.065(J2M - H2m) + O.Q15E{B - V) 

H = H2M + 0.07(J2M - H2m) + 0.005£'(S -V)- 0.03 

K = K2M + 0.01(J2M - K2m) + 0.005E{B - V) 

The E{B — V) colour excesses are computed using 
the prescription of Bonifacio et al.(2000), from the data of 
Schlegel et al. (1998). Since the extinction terms are small 
{E{B — V)=3 corresponds to 9 magnitudes of visual ex- 
tinction for a typical interstellar reddening law), differential 
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extinction across each detector is usually insufficient to con- 
tribute significantly to the uncertainty in the zero points, as 
measured by the internal scatter amongst the bright 2MASS 
sources that are used. (The uncertainties in the zero points 
are included in the image headers and the FITS binary cata- 
logue headers.) However, a small number of fields exist where 
spatially variable extinction causes significantly increased 
uncertainty in the calculated zero points. This should be 
borne in mind if the data are used to investigate e.g. possi- 
ble spatial variations in the near infrared extinction law in 
star formation regions. 

A3 Optimising the results of GPS catalogue 
searches 

Here we present three different choices of catalogue param- 
eters that are likely to be useful to generate different types 
of source list, ranging from a complete but unreliable selec- 
tion of the stellar sources in a region to a highly reliable but 
seriously incomplete selection. A fourth option is to use re- 
liableGpsPointSource in a Freeform SQL query instead 
of gpsSource. This "VIEW" of the source table is simpler 
than the methods described below but less effective. Details 
of its contents are available under "Schema Browser" at the 
WSA. 

Generating a complete list of unique Galactic sources is 
a little more complicated than simply returning all sources 
in a region of the sky. This is for two reasons. (1) gpsSource 
contains sources classified as "noise", via the attribute 
"mergedClass =0", in addition to the more numerous real 
sources which are variously classified as stars (merged- 
Class =—1), probable stars (mergedClass =—2), galaxies 
(mergedClass=-|-l) and probable galaxies (mergedClass 
=—3), on the basis of the image profile. The UKIDSS philos- 
ophy is to include in the archive all data that might be use- 
ful. However, we recommend that sources classified as noise 
be excluded from GPS searches, since most of them are not 
real sources, and they would inflate the star counts in any 
given field by ~10%. In general this will lead to a more accu- 
rate star count in any given field than a search on all types 
of source, at the price of excluding only a very small num- 
ber of real sources. By contrast, it is necessary to include 
"galaxies" and "probable galaxies" in a complete search, 
since many of these (in fact most of them in crowded fields) 
are marginally resolved pairs of stars which have been mis- 
classified in gpsSource. (2) Sources located in the regions 
of overlap between different arrays in a WFCAM tile are 
duplicated in the WSA. These duplicates can be identified 
and removed using the PriOrSec attribute, which describes 
Primary or Secondary detections (see Hambly et al.2008). 
This consideration also affects ~10% of sources. 

An illustrative SQL query to return a selection of 
useful attributes for the complete list of real unique sources 
located near / = 31, 6 = is given below. Comment lines in 
the query begin with "/*" and end with 

SELECT sourcelD, ra, dec, 1, b, jmhPnt, 
jmhPntErr, hmk_lPnt, hmk_lPntErr, 
mergedClass, pStar, jAperMagS, 
j AperMagSErr , hAperMagS, hAperMagSErr , 
k_l AperMagS , k_l AperMagSErr 

FROM gpsSource 



WHERE I BETWEEN 30.9 AND 31.1 
AND b BETWEEN -0 . 1 AND . 1 

/* Exclude detections classified as noise*/ 

AND mergedClass != 
/* Exclude multiple detections of the same 
source /* 

AND (PriOrSec=0 or PriDrSec=f ramesetID) 

It should be noted that this query must be typed or 
copied into the Freeform SQL interface, since the last line 
cannot be entered with the simple mathematical operators 
available in the Menu Query interface. The two colour dia- 
gram for the complete search at these coordinates is shown 
in figure Al(a). 

It is frequently desirable to restrict the sample to 
sources with small photometric errors, in order to identify 
the main stellar populations more easily in two colour and 
colour magnitude diagrams. A restricted search can be done 
entirely with the WSA but since the user will often wish 
to vary the search parameters while investigating the data 
it is often preferable to download the results of a complete 
search and then alter the parameters interactively. This can 
be done by loading the data into a suitable software pack- 
age. We strongly recommend TOPCAT, a user friendly soft- 
ware package designed to manipulate and display data ta- 
bles written in a variety of Virtual Observatory and text 
formats0. 

The following SQL query returns all the parameters 
needed to construct a more reliable but less complete 
photometric sample: 

SELECT sourcelD, ra, dec, 1, b, jmhPnt, 
jmhPntErr, hmk_lPnt, hmk_lPntErr, 
mergedClass, pStar, jAperMag3, 
jAperMag3Err , jEll, jppErrBits, jXi, jEta, 
hAperMag3, hAperMagSErr, hEll, hppErrBits, 
hXi, hEta, k_lAperMag3, k_lAperMag3Err , 
k_lEll, k_lppErrBits, k_lXi, k_lEta 

FROM gpsSource 

WHERE I BETWEEN 30.9 AND 31.1 
AND b BETWEEN -0 . 1 AND . 1 

/* Exclude noise detections */ 

AND mergedClass != 
/* Exclude multiple detections of the same 
source /* 

AND (PriOrSec=0 or PriDrSec=f ramesetID) 

Figure Al(b) is the two colour diagram for the sub- 
set of this download with errors < 0.05 mag on each 
axis. This was defined in TOPCAT by the expression 
"abs(jmhPntErr)<0.05 && abs(hmk_lPntErr)<0.05", 
where "abs" is the modulus operator and "&&" is the 
logical operator AND. (We note that column numbers 
can be used in TOPCAT instead of parameter names for 
brevity). The majority of the sources are located in a band 
running from the lower left to the upper right, caused by 
the reddening of the main sequence and giant star popula- 
tions. However, a large number of photometric outliers are 
apparent and some sources lie outside the boundaries of 



* TOPCAT is available from www.starlink.ac.uk/topcat 
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Figure Al. (J-H) vs.(H-K) two colour diagrams for a 0.04 deg^ area at (/,6)=(31,0) illustrate the trade off between completeness and 
reliability, (a) Complete selection of 51780 sources, excluding only "noise" sources, (b) Selection of the 8183 sources from plot (a) with 
errors <0.05 mag in each colour, (c) Selection of 5914 sources with colour errors <0.05 mag, stellar image profiles, and more reliable 
photometry, (d) Selection of 2787 sources with colour errors <0.05 mag, stellar image profiles, and the most reliable photometry. 



the plot. A smaller subsamplc with better photometry and 
> 50% completeness can be selected from the download 
with the following additional cuts (in TOPCAT format): 

pstar >0.99 && jppErrbits <256 && hppErrbits 
<256 kk k_lppErrbits <256 

The "pstar" parameter is the WSA estimate of the 
probability that the source is a star, as opposed to a galaxy, 
or noise. pstar=0.9 is the minimum value for a source to be 
classified as a star (mergedClass=-l), as opposed to e.g. 
a probable star or galaxy. In practice this selection helps to 
remove close stellar pairs that are resolved in only one or 
two of the three passbands. The "ppErrbits" parameters 
in each waveband can be used to identify sources which arc 
either cross talk artifacts (see A2) or stars with less reliable 
photometry due to saturation, deblending, bad pixels or lo- 
cation at the extreme edge of the detector. The ppErrbits 



parameters arc 16 byte integers (values from to 65535). 
The least significant byte (ppErrbits<256) contains infor- 
mation on minor issues that are unlikely to greatly affect the 
photometry: bad pixels within the 2 arcsec ApermagS aper- 
ture and whether the sources is doblonded. The more signif- 
cant bytes are populated (i.e.ppErrbits>256) if there are 
more serious issues such as saturation, cross talk artefacts, 
or loaction within one jitter position of an array boundary. 
A fuller description is given in the WSA. Wo note that the 
alternative "psat" parameter for saturation probability is 
not functional at present in the WSA. The corresponding 
plot in figure Al(c) is much cleaner: most of the unresolved 
stellar pairs have been removed by the pstar selection and 
numerous saturated stars and their associated spurious mul- 
tiple detections have been removed by the ppErrbits se- 
lection. In this case the completeness of the sample is 72%, 
relative to figure Al(b). The vertical separation between the 
reddened populations of dwaxfs and giant is now readily ap- 
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parent at (J-H)>1.75. The giants have ~0.25 mag larger 
( J-H) colours than the dwarfs. 

A sample with still more reliable photometry, but with 
lower completeness, can be defined as follows: 

pstar > 0.99 && jppErrbits < 256 && hppErrbits 

< 256 && k_lppErrbits < 256 kk 

sqrt(hXi*hXi + hEta*liEta)<0.3 kk 

sqrt (k_lXi*k_lXi + k_lEta*k_lEta)<0.3 kk 

jElKO.2 kk hElKO.2 kk k_lEll<0.2 

This is our "most reliable, low completeness" selection 

used in the two colour diagrams earlier in §3. The com- 
pleteness of this selection is also typically > 50% in the loss 
crowded parts of the GPS but is usually only 25-50% for 
fields near the mid-plane in the first quadrant. The "Ell" 
parameters give the ellipticity, which can be used to remove 
unresolved stellar pairs in a similar manner to pstar. The 
"Xi" and "Eta" parameters arc the distances in arcscconds 
of each detection from the shortest wavelength detection, 
in RA and Dec respectively. The maximum pair matching 
radius is 1 arcsec for the GPS, but since the typical sepa- 
ration of genuine matches is < 0.2 arcsec, these parameters 
can also be used to eliminate unresolved binaries and other 
false matches. Figure Al(d) shows the corresponding plot, 
which is now only 34% complete relative to figure Al(b). 
Only a few sources with poor photometry now remain. 
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